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Cavitation Effect on the Discharge 
Coefficient of the Sharp-Edged Orifice Plate 


The object of this paper is to investigate the effects of cavitation on the discharge co- 
efficient of sharp-edged orifice plates with reference to various degrees of cavitation as 
defined by a cavitation number. The experimental data described in this paper sub- 
stantiate the fact that cavitation can cxist to a minimum cavitation number of 0.2 


without introducing errors in the orifice discharge coefficient in excess of the normal 


R. OBA expected accuracy. 


In addition to this, it was found that the use of air-inhalation to 


suppress the vibration and noise from the cavitation had no effect on the discharge 


Graduate Student, 


coefficient. 
Tohoku University ff 


Introduction 


ae iT 1s desirable to use sharp-edged orifice 

plates for water flow measurement, there has been some concern 
in the use of this device where cavitation is present, particularly 
in regard to stall size pipes with relatively high velocities. While 
J. F. Bailey [1]* and G. Ruppel [2] have made investigations re- 
garding cavitation, the former experimenting with high water 
temperatures of 196.9 F and 209.1 F and the latter experimenting 
in large Reynolds number ranges, neither of these papers dealt 
directly with the cavitation effects on the discharge coefficient of 
the sharp-edged orifice plate. To this date, noliterature nor papers 
dealing directly with the stated problem have been known to us. 
It is the intent of this paper, therefore, to investigate and 
clarify the effect of cavitation occurrence on the discharge co- 
efficient of sharp-edged orifice plates and to set allowable limits 
for cavitation in order to remain within acceptable flow measure- 
ment standards. The experiment described in this paper covered 
the range of cavitation numbers K, down to 0.2, showing that 
no significant errors in flow measurement exist over this range of 
cavitation occurrence in the orifice sizes tested. It was also 
shown that air inhalation, used to suppress the vibration and 
noise due to cavitation, does not show a significant effect on the 
orifice discharge coefficient. 


Orifice and Orifice Piping 


The orifice flange used in the experiment employed corner taps 
conforming to the Japanese Engineering Standard [3] shown in 
Fig. 1. The dimensions of the JES flange and orifice assembly 
are quite similar to those used by the ASME [4] and VDI [5]. 
Five different-diameter orifice plates were used in this experiment, 
as tabulated in Table 1. 

The flange and orifice were installed in the piping arrangement, 
as shown in Fig. 2. Straightening vanes of 12 mm square lattice, 


1 English translation edited by D. M. Stough, Senior Meter Engi- 
neer, Hagan Chemicals & Controls, Inc., Pittsburgh, Pa. 

2? Numbers in brackets designate References at end of paper. 

Contributed by the Research Committee on Fluid Meters and 
presented at the Annual Meeting, New York, N. Y., November 
30—December 5, 1958, of Tae American Society of MECHANICAL 
ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, March 8, 
1957. Paper No. 58—A-93. 
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Table 1 Dimensions of specimen pipe and orifice plates 


Pipe diameter Orifice diameter Diameter ratio 


D, mm d, mm 8 = d/D 
23.33 0 .224¢ 

104.10 46.69 0.448 
51.94 0.499 
61.73 0.593 
23.33 0.222 

105.26 46.69 0.444 
66.60 0. 633° 


* The orifice diameters were selected for even values of m, (d/D)*. 


Se 
S, 
,45° 
Zz LLLLLL 


Fig. 1 Orifice plate and pressure taps 


0.8 mm thick, were employed 25 diameters upstream from the 
orifice to produce desirable metering conditions. The distance 
PS was varied to produce the required total straight run of pipe 
for the different-diam-ratio orifice plates, in accordance with the 
following tabulation: 


PO = 30 D for 8 S 0.499 
PO = 39 D for 8 = 0.593 


The orifice plates were made of bronze with a 15-cm length of 
clear plastic immediately following the orifice flange for purposes 
of observation. The water was pumped by means of centrifugal 
pump C through the flow section to a tank containing a right- 
angled triangular notch weir T suitable for flow measurement. 
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Fig. 2 Experimental installation 


Method of Test 


A U-tube mercury manometer was used to measure the pressure 
difference across the orifice plates at taps m; and mp», with a 
measurement of static pressure at tap mz. It was found neces- 
sary to frequently vent the air accumulated in the piping by 
means of vent cocks located at taps a and a. The rate of flow 
through the orifice plates was measured by means of the right- 
angled triangular notch T, which had been previously cali- 
brated by means of a volume tank at the Institute of High Speed 
Mechanics of Tohoku University. The rate of flow and static 
pressure were regulated by means of valves v1, Vz, and vs. 

The formation and size of the cavitation were observed and 


Fig. 3(a) Flash duration 1/100,000 sec; 


fy = 21.6C; 8 = 0.444; 
Ka = 1.50 


photographed throughout the experiment by means of the plastic 
section immediately following the orifice flange. These photo- 
graphs are shown in Fig. 3 for various values of K, [6], taken ata 
flash duration of 1/100,000 sec. The photographs show the 
cavitation occurrence much clearer than the naked eye. 

The discharge coefficient K, the cavitation number K,, and the 


pipe Reynolds number Rp were calculated from the following 
formulas: 


ots. 
= = (Pi P, | 
4 


Fig. 3(b) Flash duration 1/100,000 sec; 


fe = 216C; B = 0,444; 
Ka = 1.10 


Fig. 3 Moment photographs of appearance of cavitation occurrence 
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| = 5 OR 14D) 
“4 Pp s Po 
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pr 
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4) 


re 


flow rate through pipe, ft* per sec 
orifice diameter, ft 
acceleration due to gravity, ft per sec* 
static pressure at upstream tap, ft 
static pressure at downstream tap, ft 
specific gravity of water, dimensionless 
2¢(P: — P4) 


where 


P, = vapor pressure of water, ft 
V, = average velocity through orifice, ft per sec 


Rp = — 


where 


V = average velocity through pipe, ft per sec 
D = inside pipe diameter, ft 
vy = kinematic viscosity of water, ft* per sec 


The point I shown in Fig. 2, located 14 em downstream from 
the orifice plate, is a 6-mm-diam hole drilled in the pipe to permit 
air inhalation. This hole was used during certain portions of the 
experiment to suppress vibration and noise due to excessive cavi- 
tation, and will be discussed later. j 


Fig. 3(d) Flash duration 1/100,000 sec; ft. = 21.6C; 8 = 0.444; 


Fig. 3(c) Flash duration 1/100,000 sec; Fig. 3(e) Flash duration 1/100,000 sec; #, = 21.6C; 8 = 0.444; 
Kg = 0.90 Kz = 0.30 
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0 4=0593 
6 =0499 
| 6 A=0.448 


@ 6 =0.224 
tw =@7.7-15.2 °C 
= 104-105 
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— 
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Kdi, INCIPIENT CAVITATION NUMBER 


160 


P,, OOWNSTREAM PRESSURE ,cm Hg Abs. 


Fig. 4 Relation between incipient cavitation number K,; and downstream pressure P, 


Manner of Cavitation Occurrence 


The relation between the incipient cavitation number K ,,, de- 
fined as the state of flow at which small bubbles of cavitation are 
seen only intermittently, and the absolute static pressure P: at 
the downstream pressure tap mz is illustrated in Fig. 4. As shown 
in Fig. 4, the incipient cavitation number is approximately 2.5 
irrespective of the orifice diameter ratio, 8. : 

The symbol t,, is the water temperature, a is the absolute air 
content of the water, and a, is the saturation air content of the 
water. It should be noted that the ratio of a/a, is normally very 
close to one, and that changes in this ratio affect only slightly the 
incipient cavitation number and have no particular effect on 
cavitation numbers under the incipient cavitation number. 

As a matter of convenience the observed conditions of cavita- 
tion with respect to cavitation number have been tabulated in 
Table 2. 


Table 2 Manner of « 


Cavitation 
number K4 


2.5 


Manner of cavitation occurrence 
Small cavitation bubbles are seen by the 
unaided eyes intermittently 
Small bubbles float in the dead space 
Cavitation occurs from the orifice edge; 
somewhat unstable flow; large noise 
The pipe is almost filled with cavitation 
bubbles ; very unstable flow; large noise 


1.0 
0.6 


0.25 


Variation of Discharge Coefficient With Changing Values of 
Cavitation Number 


A study of the orifice discharge coefficient K was made with 
respect to different values of cavitation number for four different- 
diameter-ratio orifice plates. This study is shown in Fig. 5. It 
can be seen clearly in Fig. 5 that no change in the discharge co- 
efficient was apparent between the noncavitating and cavitating 
flow condition down to a cavitation number of 0.2. Below line Z 
the noise and vibration from the cavitation became quite ap- 
parent. 


Variation of Discharge Coefficient With Changing Values of 
Reynolds Number 


The orifice discharge coefficients K are plotted against pipe 
Reynolds number Fp for various orifice-diameter ratios as shown 
in Fig. 6. 

It is clear from this figure that the variations of the discharge 
coefficients with changing values of Reynolds number are smaller 
than the allowable tolerance of the standards in this experiment, 
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Table 3 Comparison between standard and experimental values of 
discharge coefficient K 


Diameter 
ratio 
B = d/D 
0.224 
0.448 
0.499 
0.593 
0.222 
0.444 
0.633 


A-— 
Experimental 
value 
+ 0.002 
+ 0.003 
+ 0.003 
+ 0.003 


—Discharge coefficient 
Standard 
value 


0.598 
0.615 
0.623 
0.647 


When the cavitation occurs, even violently (Kz = 0.2), the 
dispersion of the experimental values obtained is less than 
+0.5 per cent. Therefore the variation of the discharge co- 
efficient is smaller than the allowable tolerances stipulated by the 
following standards: 


Diameter 
ratio 


Allowable 
tolerance 


+0.5 per cent 
+1.0 per cent 


+0.5 per cent 


+0.5 per cent 
+1.0 per cent 


Standard 
JES 


Type 
Corner taps 
Flange 


Corner taps 
Corner taps 


ASME 
DIN 


Table 3 shows a tabulation of the discharge coefficients ob- 
tained experimentally for the various diameter-ratio orifice plates 
tested. This table also lists the DIN standard coefficients for the 
same diameter-ratio orifice plates. It will be noted that the dis- 
charge coefficient for the 0.593-diameter-ratio orifice plate is ap- 
proximately 0.4 per cent higher than the standard value, which 
is attributed to a formation of rust in the upstream orifice piping, 
appearing before this orifice was tested. Table 3 substantiates 
that the test data for the various orifice plates agree with the 
standard DIN data within the allowable tolerance of +0.5 per 
cent. 


Counter Measure for Vibration and Noise 


The objectionable noise and vibration noted in the lower cavi- 
tation number regions in Fig. 5 and tabulated in Table 4 can be 
reduced or eliminated by the use of air-inhalation. In this ex- 
periment a quantity of air was allowed to enter the piping at a 
point 14 cm downstream from the orifice plate through a 6-mm- 
diam hole. 

Fig. 7 shows the effect of different air-inhalation coefficients on 
the discharge coefficients of various diameter-ratio orifice plates. 
The air-inhalation coefficient, q/Q, is the ratio of inhaled air 
volume at the downstream tap conditions to the flowing volume. 
Fig. 7 shows that the discharge coefficients for the lower diameter 
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Fig. 5 Relatior between cavitation number K , and discharge coefficient K 
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Fig. 6 Relation between Reynolds number R» and discharge coefficient K 
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ratio orifice plates, namely, for B equals 0.222 and 8 equals 
0.444, remain within the allowable tolerance of +0.5 per cent with 
an air-inhalation coefficient as great as 0.25, while the air-inhala- 
tion coefficient for the higher diameter-ratio orifice piate, namely, 
0.633, must not be greater than 0.08 to maintain the allowable 
tolerance for the discharge coefficient. 

Fig. 8 shows the relationship between the minimum air-inhala- 
tion coefficient and cavitation number for various diameter ratio 
orifice plates to produce optimum noise and vibration suppression. 
It may be noted that the minimum air-inhalation coefficient re- 
quired to suppress the vibration and noise increases with a de- 
sumcrease in cavitation number. 


Conclusion 


The results obtained concerning the effects of cavitation on the 
discharge coefficients of standard sharp-edged orifice plates are 
summarized as follows: 


1 The discharge coefficient K is not influenced beyond the 
standard allowable tolerances to a minimum cavitation number 
of 0.2. 

2 The incipient cavitation number is approximately 2.5 and 
is independent of the orifice-diameter ratio in the range of the 
test orifices, the experimental velocity, and the static pressure. 

3 The method of air-inhalation may be employed to suppress 
the vibration and noise due to cavitation without influencing the 
orifice-discharge coefficient beyond the allowable tolerance of 
+0.5 per cent. 
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DISCUSSION 
James W. Ball® 


The material contained in the paper is very interesting and 
establishes definitely that the basic laws for orifices are true. 
Work of this nature signifies progress and increases the confidence 
of those who use the results. The authors are to be commended 
for their contribution. 

Actually, an examination of basic relationships for orifices con- 
firms the results. The differential head between two pressure 
taps, one upstream and the other downstream of an orifice in a 
pipeline, is an indication of the amount of flow through the 
orifice irrespective of the location of the taps. The change in hy- 
draulic grade between the two taps (differential head) results from 
three sources: (1) Head to cause the flow through the orifice, (2) 
head loss due to boundary friction, and (3) head loss due to turbu- 
lent eddies. The amount contributed by each of the three sources 
will vary with the geometry of the jet issuing from the orifice and 
the distance of the taps from the orifice. For fully developed 
turbulence and a given orifice, the flow pattern geometry in the 


* Hydraulic Research Engineer, Head, Hydraulic Structures and 
Equipment Section, Hydraulic Laboratory, Bureau of Reclamation, 
Denver, Colo. 
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pipe does not change noticeably, provided the downstream pipe is 
kept under back pressures which assures that the jet is surrounded 
by water. The coefficient of discharge for a given orifice to pipe 
diameter ratio will be constant for any tap location as long as this 
condition exists. The location of the upstream tap is not too im- 
portant. However, this is not true of the downstream tap. A 
fixed location for the upstream tap will be considered for the fol- 
lowing discussion. As the pressure gradient downstream from an 
orifice is lowered and the cavitation envelope forms and extends 
sufficiently to alter the flow stream geometry, the coefficient, based 
on taps affected by this alteration will deviate from the constant 
value. The smallest deviation will occur for taps least affected 
by the change in the stream geometry. The stream geometry up- 
stream from the vena contracta remains essentially the same re- 
gardless of the downstream pressure; thus the discharge co- 
efficients based on measurements at downstream taps located 
at or upstream from the vena contracta will not be changed by the 
presence of cavitation downstream from the orifice. This ex- 
plains why the coefficient of discharge is essentially constant for 
the flange taps used in the tests discussed in the paper and why 
similar results shown in Fig. 9 were obtained in tests at cavitation 
numbers as low as 0.001 and heads up to 160 feet made in a test 
facility (Fig. 10) in the Bureau of Reclamation Hydraulic Labora- 
tory. This also explains why the coefficient of discharge decreases 
abruptly when the downstream tap is located any appreciable dis- 
tance downstream from the vena contracta as is the case for part 
of the data shown on Fig. 11. However, the coefficients in these 
cases remain substantially constant until the hydraulic grade in 
the “recovery” region moves past the tap location. 

As the cavitation envelope below an orifice lengthens due to de- 
creased back pressure and the recovery region moves past the 
downstream tap, the pressure at this tap is lowered, increasing the 
apparent differential head between the upstream and downstream 
taps while neither chinging the pressure immediately down- 
stream from the orifice nor the discharge through the orifice. 
This results in a decrease in coefficient as shown on the curves of 
Fig. 11. As the cavitation envelope extends and vapor pressure 
reaches the downstream tap, the coefficient again becomes con- 
stant and has a value equal to that for flange taps. 

There is a difference in numerical value of the coefficient de- 
pending on how far the downstream tap is located from the orifice. 
This is due mainly to the difference in friction and eddy losses to 
the various locations. The differences for taps located 0, 2.1, 15, 
and 36 inches downstream from 23/s, 13/,, and 1'/,-in. orifices in a 
3-in. standard pipe are shown in the plots on Fig. 11. The dif- 
ferences for the two locations farthest downstream become in- 
significant for the smallest orifice where the velocities in the pipe 
are very low and the friction and eddy loss differences become an 
insignificant part of the total head differential. The shape of the 
curves, cavitation number K versus coefficient of discharge C, 
in any case will be essentially the same for both locations in the 
range where the recovery region moves past the tap. Where 
the tap is located just downstream from the vena contracta as for 
the 2.1-in. location, Sta. (3), on the 2*/,-in. orifice, the coefficient 
will be slightly larger than for flange taps until vapor pressure 
reaches the tap (Fig. 11). 

A flow nozzle 1.333 inches in diameter (Fig. 12) was tested in 
the same setting as the three orifices. The data for the nozzle are 
plotted on both Figs. 9 and 12. The Kg versus C curve on Fig. 9 
is quite different from that for orifices, indicating that cavitation 
does affect the efficiency of this particular shape. Also, for this 
particular nozzle, there seemed to be a slight change in the value of 
C with head, the cause of which has not been definitely determined 
but may be an influence introduced by the shape of the nozzle. 
The discharge coefficient of this nozzle is affected by cavitation 
irrespective of the downstream tap location. 
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H, = Pressure heod at upstream flange top in feet. 
He= Pressure head at downstream flange tap in feet. 
Hy= Vapor pressure of woter in feet. 


Vo= Average velocity through orifice in feet per second. 
AozArea of orifice in square feet. 
Q =Rate of flow in cubic feet per second. 
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Fig. 10 Test facilities for orifices and flow nozzle 


The cavitation data given in the paper are very interesting, and 
their publication has brought out some interesting facts. The 
cavitation number for incipient cavitation obtained by ear for the 
11/,, 18/4, and 2*/s-in. orifices tested in the Bureau of Reclamation 
Hydraulic Laboratory and based on the relationships given in the 
paper were 1.3, 1.1, and 1.0, respectively. These values appear to 
agree reasonably well with the paper; however, the trend is for 
Ka determined by ear to increase with decrease in 8 rather than 
increase with 8 as does line “L’’ which is defined as “where noise 
and vibration from cavitation became quite apparent.” (Fig. 6 
of the paper.) 

In one series of tests made in the Bureau of Reclamation Hy- 
draulic Laboratory on the 1*/,-in. orifice, the value of K for in- 
cipient cavitation, using the relationships given in the paper, did 
not agree with that given in the paper. Also, an examination of 
the test results made on two 2.083-in. orifices, placed in tandem in 
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Fig. 9 Variation of discharge coefficient C with cavitation number K 
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a 3-in. pipeline to serve as a pressure reducing system,‘ indicated 
poor agreement. In these instances it appeared that cavitation 
occurred at much smaller values of K than shown in the paper. 
Check tests on the 1*/,-in. orifice showed the initial tests to be in 
error. The check tests on this orifice agreed reasonably well with 
the results given in the paper and it was theorized that a small air 
leak into the system occurred downstream from the orifice, thus 
preventing cavitation noise (crepitation) until a lower value of K 
was reached. 

In the tests in which the two 2.083-in. orifices were used in 
tandem in a 3-in. pipeline (Figure 13), the criterion was that the 
noise level was acceptable. This point was therefore somewhat 
above that for incipient cavitation and agreement with the data 
in the paper could not be expected. 

The cavitation number given in the paper is not in a form 
which could be readily used by designers to determine whether or 
not there would be objectionable cavitation and vibration in an 
orifice installation. In studies on valves in the Bureau of Recla- 
mation Hydraulic Laboratory® another form of cavitation num- 
ber was found most useful for this purpose. The same relation- 
ship can be applied to orifices in a pipeline. The data for three 
orifices using this relationship 


H, #, 


and C= 


Q 
Ao V% H 
are plotted on Fig. 11 where the symbols in both equations are 
defined. 


‘L. V. Wilson, ‘‘Hydraulic Studies of a Pressure Reducing System 
for the Transformer Cooling Water—Grand Coulee Power Plant,” 
Bureau of Reclamation Hydraulic Laboratory Report Hyd-308, 
March, 1951. 

‘James W. Ball, “Cavitation Characteristics of Gate Valves and 
Globe, Valves Used as Flow Regulators Under Heads Up to About 
125 Feet,’’ Trans. ASME, vol. 79, 1957, pp. 1275-1283. 
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Kj= Cavitation index where cavitation becomes audible. 

Ho=Pressure head one diameter upstream. 

H,=Total upstream head (pressure plus velocity). 

Hy=Vapor pressure of water relative to atmosphere. 

_ H,=Downstream pressure head at location noted 
below (feet of water). 

C= Coefficient of discharge. 

Ag=Area of orifice in square feet. 

o-K and C based on heads one pipe diameter upstream 
and II.7 pipe diameters downstream from orifice. 

© -K and C based on heads one pipe diameter upstream 
and 4.9 pipe diameters downstream from orifice. 

4-K and C based on heads one pipe diameter upstream 
and 0.7 pipe diameter downstream from orifice. 

e- and C based on heads one pipe diameter upstream 
from orifice and at downstream flange. 

4 -Cavitation not audible 

9-Cavitation audibility questionable. 
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Fig. 11 


Knowing the critical cavitation number K, at which cavitation 
: is incipient for an orifice, it is possible to determine the back pres- 
sure required to prevent cavitation for a given upstream head, or 
to compute the allowable upstream head for a given back pres- 
sure. It is believed that the variation of K with 8’, where @’ is 
the ratio of the area of the jet af the vena contracta to that of the 
pipe, should be somewhat like that shown in Fig. 14. This rela- 
tionship was obtained by assuming general cavitation just be- 
ginning in the pipe immediately downstream and a loss to the 
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Variation of discharge coefficient C with cavitation number K for orifices 1'/;, 1°/;, and 2°/s-in. diameter 


recovery pressure downstream equal to the Borda Loss ex- 
pressed by Hy, = (V, — V,)*/2g. Vo = velocity of the contracted 
jet® and V, = velocity in the pipe downstream. 

Values of K,; determined by ear for the three orifices and based 
on the relationship in Fig. 11 for taps located at Sta. 4 and Sta. 5 
(4.9 and 11.7 pipe diameters downstream of the orifices) are also 
plotted in Fig. 14. The curve formed by these values is much 


* Contraction coefficient obtained from page 35, ‘‘Engineering Hy- 
draulics,’’ by Hunter Rouse. 
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Fig. 12 Variation of discharge coefficient C with cavitation number K for flow nozzle 1.333 inches in diameter 
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higher than that based on the occurrence of vapor pressure im- 
mediately downstream from the orifices and the Borda Loss. 
This signifies that cavitation occurs before vapor pressure is 
registered at flange or vena contracta taps. 

Several tests on the three orifices were made for conditions 
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Fig. 14 Critical cavitation number K; for orifices and flow nozzle 


where vapor pressure at the downstream flange tap was believed 
to be incipient. The values of K for this condition, based on 
downstream taps at Stations 4 and 5, were plotted for comparison 
with the values obtained using the Borda Loss relationship. The 
K values for the orifices, based on heads at Stations 4 and 5 and 
obtained from plots of the pressuré-data, were in good agreement, 
Fig. 14. The agreement is excellent considering that the informa- 
tion was taken from tests not made for that specific purpose, that 
it was questionable that vapor pressure was incipient at the 
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downstream flange taps, and that it was questionable that the 
taps were located or could be located to indicate the conditions 
assumed for the Borda Loss relationship. 

The values of K, for the three different orifices were obtained 
by ear. The audibility of cavitation was noted for each test run. 
There were three conditions observed, (1) cavitation definitely 
audible, (2) cavitation definitely not audible, and (3) cavitation 
audibility questionable. The third condition gave the most dif- 
ficulty but was very useful in selecting K, for each orifice. 

It is questionable that cavitation actually exists in all the 
“manner of cavitation occurrence’”’ definitions given in Table 2 of 
the paper. It is believed that the bubbles described for a cavita- 
tion number of 2.5 are those formed by air being released from 
solution as the flow passes into the low pressure zone immediately 
downstream from the orifice rather than vapor cavities. Bubbles 
formed by air coming out of solution and then moving in the eddy 
zone downstream, as described in Table 2 for a cavitation number 
of 1.0, are not necessarily vapor cavities and cannot be considered 
definite proof that cavitation is present. Air coming out of solu- 
tion in sufficient quantity may influence the point of incipient 
cavitation and muffle the crepitation. Similar tests in a water 
tunnel, using deaerated water, would be of particular interest. 

It is of interest at this time to point out that because of rela- 
tively high head losses, orifices can be used in tandem for energy 
dissipation, Fig. 13. However, because of the intense noise and 
vibration accompanying cavitation, it is important to prevent 
cavitation by providing sufficient back pressure. Also, it is im- 
perative to consider the fact that wide fluctuations in pressure 
occur in the eddy zone immediately downstream from the orifice 
at all times when high-velocity flows are being handled. Because 
of the critical factors involved it is expedient and wise to make 
hydraulic investigations before designs employing this principle 
are used. More interesting facts about orifices will no doubt be 
forthcoming when studies are made of the transient pressures 
downstream. 

A point of interest concerns measurement of head at pressure 
taps located downstream from orifices. Wide fluctuations in 
pressure make it imperative to obtain a sufficient number of 
readings to obtain average values or use ample damping in the 
pressure recording units to obtain average values. The con- 
sistency of data will depend on the care exercised in recording the 
head. This factor no doubt is the reason for the variable spread 
in points on the curves of the paper and this discussion. 

The arrangement shown schematically in Fig. 13 has been used 
to reduce the head from 163 feet to 38 feet for a transformer cool- 
ing water system.‘ An installation, using a needle valve for a 
variable orifice, has been employed successfully in reducing a head 
of 630 feet to 250 feet to supply and make use of an existing 
power plant designed for the lower head.?__ In both of these cases 
it appears that the major portion of the energy in a high-velocity 
jet discharging into a sudden enlargement is dissipated in the first 
5 to 6 diameters length of the enlarged section. This condition 
of course is not true if a general zone of cavitation is permitted to 
occur immediately downstream from the point of entry. Ex- 
tremely high heads may also alter this observed characteristic. 

Some work not yet published has been performed by the 
Bureau of Reclamation Hydraulic Laboratory on the back pres- 
sure necessary to prevent destructive cavitation for valves dis- 
charging into sudden enlargements of various sizes. Briefly, the 
tests have shown that cavitation numbers, based on the relation- 
ship given in Fig. 11, may be as low as 0.20 without causing pitting 
of the walls of the sudden enlargement when a diameter ratio, 
valve to pipe, of 1.75 or more is used. Also, the admission of a 
very small amount of air near the downstream end of the valve 
will quiet crepitation immensely. 

7K. Stierlin, ‘Pressure Reducing Plant for a Water Power Sta- 
tion,”’ Escher Wyss News, vol. 30, no. 1, 1957. 
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The standard hydraulic symbols C and K, for discharge co- 
efficient and cavitation number, are used in this discussion. 
The use of K for both, as in the paper, is somewhat confusing. 


Authors’ Closure 
(Written by F. Numachi) 


The authors wish to express their appreciation of the under- 
standing discussion given by Mr. James W. Ball. The data given 
in his discussion back up the main tenor of their paper. Es- 
pecially they are grateful for the experimental proof in his discus- 
sion that the C-value is not affected by cavitation where the value 
of cavitation number is not only down to 0.2 but also to 0.001. 

He clarified that the K, versus C-curve of nozzle is affected by 
cavitation, and the results are obviously useful and interesting. 

In the authors’ paper, flow measurement by means of standard 
pipe orifice is the center of their interest, while Mr. Ball gives his 
opinion on cases where the position of taps are changed and tan- 
dem setting of orifices, free from standardization of pipe orifice. 
Those will be very helpful to pressure reducer, and so forth, 

That the C-value of pipe orifice varies with the stations of taps 
results evidently from the pressure variation on the pipe wall 
downstream,’ which fact is supposed to be one of the factors that 
lead to determining the stations of both taps of present standard 
orifice. 

The stations of taps are different in actuality due to the dis- 
parity of the ASME and DIN (German) or JES (Japan) stand- 
ards as follows: 


DIN and JES Corner taps (directly to orifice plate) 
ASME (a) Flange taps (one inch upstream and one inch 
downstream from orifice plate) 
(b) Pipe taps (2.5 D upstream and 8 D down- 
stream) 


The difference between C-values of the ASME standard? (flange 
taps) and those of the corner taps exceed the allowable toler- 
ance of ASME standard (+0.5 per cent) when the Reynolds 
number F pis smaller than the values given below depending upon 
the values of 8. 


B 0.224 0.448 0.499 0.593 0.633 
Rp X 10~* 4 10 15 20 25 


The relation between the cavitation coefficient K which Mr. 
Ball advocates and the authors’ cavitation coefficient Ky is ex- 
pressed as follows: 


K, = KB + aun) 


Since there is no substantial difference between K and K,, 
the authors cannot definitely decide which is more suitable for de- 
signing. But usually, Hp, i.e., pressure head one diameter up- 
stream from the orifice plate, is not measured in ASME standard. 
Cavitation number K containing such value as Hy (Ref. Hp = 
Ho + (87Q/Ao)?/2g) is considered as inconvenient for designing. 

Mr. Ball says that he thinks the phenomena in K, = 2.5 to 1.0 
are not cavitation but air releasing. According to the information 


8 J. L. Hodgson, “The Measurement of the Flow of Gases and 
Liquids by Means of Orifices, Nozzles, and Tubes,”’ Proceedings, 
World Engineering Congress, Tokyo, 1929, vol. IV, p. 111. 

F. C. Johansen, ‘“‘Flow Through Pipe Orifices at Low Reynolds 
Numbers,” Aeronautical Research Committee, Reports and Memo- 
randa, no. 1252, 1929. 

*R. F. Stearns, R. R. Johnson, R. M. Jackson, and C. A. Larson, 
“Flow Measurement With Orifice Meters,"” D. Van Nostrand Com- 
pany (Canada), Ltd., 1951, p. 222. 
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recognized recently," cavitation occurrence is regarded as im- 
possible in the absence of nucleus of microscopic air bubble even if 
in the presence of dissolved air, that is, no cavitation is found in 
pure vaporization of water. This is different from Mr. Ball’s 
view. 

Mr. Ball showed an interest in the audibility of cavitation in- 
cipience. Authors adopted three methods for detection of cavita- 
tion incipience, that is, (i) by ear, (ii) with the naked eye, (iii) 
receiving of ultrasonic waves emitted by cavitation."' The 

%E. N. Harvey, Wm. D. McElroy, and A. H. Whiteley, “On 
Cavity Formation in Water,” Journal of Applied Physics, vol. 18, 
1947, p. 162. 

R. T. Knapp, “Cavitation and Nuclei,”” Trans. ASME, vol. 80, 
1958, p. 1315. 
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method (i) is sometimes disturbed by the noise of machinery. The 
values of K;, for the pipe orifice obtained by three methods agreed 
with one another. The result concerning method (iii) shall be 
made public in the near future. In K, = 2.5 to 1.0, ultrasonic 
wave was received. An ultrasonic shock wave occurs with the 
collapse of cavitation bubbles, but perhaps not with mere air re- 
leasing from water. 


11 F. Numachi, ‘Ultraschallwelle am Tragfligelprofil bei Hohlsog / 
Teil 1," Forsch. Ing.-Wes. vol. 22, 1956, pp. 77-84. 

F. Numachi, “‘Ultraschallwelle am Tragfiigelprofil bei Hohlsog / 
Teil 2," Forsch. Ing.-Wes., vol. 24, 1958, pp. 73-78. 

F. Numachi, “Transitional Phenomena in Ultrasonic Shock 
Waves Emitted by Cavitation on Hydrofoils,” Jounnat or Basic 
ENGINEERING, Trans. ASME, Series D, vol. 81, 1959, pp. 153-166. 
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Measurement of Oil-Film Thickness 


S. I. EL-SISI 


Mechanical Engineering Department, 
Assuit University, Assuit, Egypt 


G. S. A. SHAWKI 


Between Disks by Electrical Conductivity 


A special apparatus is designed to measure the oil-film thickness between two sta- 


Mechanical Engineering Department, 
Cairo University, Cairo, Egypt 


tionary disks (simulating meshing gear teeth) by applying a-c or d-c potential across the 
film and measuring its resistance for given gap between the disks. 


Experiments show that the treatment of oil with additive (sodium-petroleum sulphonate 
in this case) may well lead to a considerable increase in its electrical conductivity, and to 
a substantial improvement in the independence of the oil-film resistance of the electrical 
history and temperature of the oil. A reliable relationship could eventually be obtained 
between the oil-film resistance and relevant thickness. 

Test results are employed in investigating the behavior of the oil film in a rotating- 
disk testing macine developed by the authors for studying the performance of meshing 
gear teeth with adequate supply of lubricant. 


Introduction 


HE ACCURATE MEASUREMENT of thin oil films has, 
for some time, handicapped quantitative investigations on the per- 
formance of lubricating oil films formed between gear teeth. The 
method of electrical conductivity was tried by Cameron [1]! and 
Hughes [3] who mentioned in his report that the interpretation of 
electrical resistance in terms of film thickness is not possible, at 
least with ordinary lubricating oils. This may be attributed to 
the high insulating properties of these oils, and to the dependence 
of the breakdown voltage on the previous electrical history of 
the film. 

It is with this picture in mind that the present research has 
been planned and carried out in an attempt to obtain an additive 
which would improve the electrical conductivity of oil, and would 
impart to it certain basic qualities which would render it suitable 
for use in this method of measurement. Only final results of the 
numerous tests made are herein presented. 

The apparatus described in this paper is virtually a length- 
measuring device in which oil films may be formed between two 
stationary disks whose distance apart can be finely adjusted and 
precisely measured. The disks are set at a given distance apart 
and the electrical current flowing through the film is measured 
for specified potential applied across it. This would lead to some 
relation between the oil-film resistance and relevant thickness at 
the thinnest section. Such relation would be very useful for in- 
terpreting current measurements for the rotating disk testing 
machine [2]. 


Description of Apparatus 


The general arrangement of the apparatus is shown in Fig. 1 
and a front view in Fig. 2. 

Two identical disks 1 and 2 are mounted on two horizontal 
spindles 3 and 4 with their axes in same vertical plane. Both 
disks are made of case-hardened mild steel, and measure 6 inches 
in diameter and 2 in. in width. They are electrically insulated 
from their respective spindles by means of perspex bushes. 

The top disk spindle 4 is fixed to a steel frame 5 supported by 
three legs 6 resting on a surface plate. 

The bottom disk spindle 3 is fixed to a steel framework 7 fitted, 

! Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division and presented at the 
Annual Meeting, New York, N. Y., November 30—-December 5, 1958, 
of Tue American Society oF MEecHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, July 11, 
1958. Paper No. 58—A-253. 
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at its extreme end, with a micrometer 8 resting against a stand 9. 
It is also fixed to two L-shaped steel pieces 10 pivoted on the main 
frame 5 through two balls symmetrically disposed with respect to 
the bottom disk. These balls are fitted in the L-shaped pieces 
and rest on two vertical flat-topped set screws 11 screwed into the 
main frame. 

It is evident, therefore, that the whole of the bottom disk as- 
sembly is virtually a lever with its fulcrum at the two balls and its 
free end at the micrometer. The magnitude of disk displacement 
rendered by leverage has been found, with the help of Johansson 
slip gages inserted between the disks, to be equal to 45.81. 

The bottom disk assembly is made, through two helical ex- 
tension springs 12, to rest against two horizontal pointed set 
screws 13 fitted in two steel brackets 14 which are, in turn, fixed 
to two of the apparatus legs. 

The components are so designed that, when the brackets are at 
their highest position in contact with the underneath part of the 
top plate of the main frame, the axes of set screws 13, the points of 
contact of the two balls, and the point of contact of the microme- 
ter, all lie in one horizontal plane passing by the line of contact of 
the two disks. This arrangement results in a negligible horizontal 
movement of the bottom disk (with respect to its vertical move- 
ment) within the useful '/.-in. range of the micrometer. 

The points of contact of the fulerum balls and the points of 
contact of set screws 13 are made to lie on one horizontal axis 
about which the whole of the bottom disk assembly may turn. 

A reservoir made of perspex is cemented to the bottom disk to 
provide an oil bath for the gap between the disks. The bath is 
fitted with a small heating element and a drain cock. 

Both direct and alternating potentials can be applied across the 
disks. The d-c circuit, Fig. 3, consists of a 2-volt accumulator 
connected to the terminals of a potentiometer by means of which 
the voltage across the gap may be adjusted. The current is 
measured by a microammeter, and the voltage by a vacuum-tube 
voltmeter. 

The a-c circuit comprises a bridge, one arm of which is formed 
by the oil film, while the other arm has a sensitive nonreactive 
resistance box, the balance point being detected by an oscilloscope. 
The potential applied across the film is measured by a vacuum- 
tube voltmeter, and has a maximum value of 2 volts. 

The table carrying the apparatus is mounted on rubber pads to 
isolate it from vibrations that may be transmitted from the 
floor, and measuring instruments are placed on a separate table. 


Test Procedure 
The top disk is first removed to have the surfaces of the disks 
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Fig. 1 General arrangement of the apparatus 


(1) Bottom disk 

(2) Top disk 

(3) Bottom spindle 
(4) Top spindle 

(5) Main frame 

(6) Supporting legs 
(7) Framework 

(8) Micrometer 


(9) Stand 
(10) L-shaped pieces 
(11) Set screws 
(12) Helical extension spring 
(13) Set screws 
(14) Brackets 
(15) Plane surface gage block 


Fig. 2 Front view of the apparatus 
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A.C. CIRCUIT 
Fig. 3 Electric circuits used in the tests 


thoroughly cleaned with acetone, and then fixed again in its 
position. 

The axes of the disks are brought into the same vertical plane 
with the help of a plane surface gage block 15, Fig. 1, and a light 
source, the set screws 13 being adjusted to obtain line contact be- 
tween the bottom disk and the gage block. 

The next step is to adjust the axes of the disks to the parallel 
position. The gage block is therefore carefully removed, and a 
parallel beam of light is projected from a collimator toward the 
disks’ contact from the micrometer side, the apparatus being en- 
closed in a dark space for accuracy of observation. The vertical 
set screws 11 supporting the fulcrum balls are adjusted so that the 
width of the band of light seen from the other end of the collimator 
becomes as uniform as possible. The bottom disk is thereafter 
moved very slowly toward the top disk, by means of the microme- 
ter. Should the axes of the disks be parallel, the light, on touch- 
ing the disks, would disappear over the whole width of band at the 
same time. 

It should be noted that the micrometer reading at which the 
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light disappears is not taken as the zero reading corresponding to 
the touching of the disks, since the same is detected by the com- 
pletion of the electric circuit. It has been found that the bottom 
disk should be moved a farther distance of about 0.000005 in. 
toward the top disk before the electric circuit may be com- 
pleted. This latter distance lies within the range of surface 
roughness of the disks. 

After adjusting the axes of the disks parallel and in the same 
vertical plane, oil is poured into the reservoir, the region of closest 
approach of the disks being submerged in the oil. The extent of 
the film may be varied by changing the oil level in the bath, and 
the oil temperature by adjusting the current flow in the heating 
element. 

Either a-c or d-c potential is applied across the gap between the 
disks, respective circuits being used. 

Tests were started with the largest possible gap between the 
disk (0.010 in.) and then the gap was progressively reduced. 
The reverse procedure was avoided as it gave unreliable results; 
the apparent reason seems to be that, with this procedure, oil may 
not completely fill the gap. 


Behavior of Untreated Oil 


“Castrolite 20 A” oil is first used without additive, its behavior 
being obtained for both a-c and d-c potential applied across the 
film. 

Fig. 4 shows that, for a given gap, the oil-film resistance drops 
sharply with increase in applied potential. The oil is also found 
to become temporarily more conductive as the voltage increases; 
this is indicated by the test points obtained while the voltage was 
reduced to zero and then brought back to its original value. 

The variation of oil-film resistance with the gap between the 
disks, Fig. 5, is obtained for a constant a-c potential of 0.05 volt 
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Fig. 4 Oil-film resistance versus upplied potential for untreated oil. 
(Each curve is obtained for a new sample of oil); oil temperature: 56 F. 


—— a.c. potential 
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and for two values of the oil temperature; namely, 53 and 160 F. 
It can be seen that the oil-film resistance varies considerably with 
temperature. Moreover, the curves obtained are not very reliable 
since the applied potential may, for relatively small gaps, cause 
breakdown in the oil resistance and hence a change in the electri- 
cal behavior of the oil. 

It is worth-while mentioning that Cameron [1], while ex- 
perimenting on a disk-testing machine designed by Merritt [4], 
obtained a relationship between oil-film resistance and applied 
d-c potential quite similar to that obtained in the present tests, 
also that Hughes [3] has noted the considerable effect of tem- 
perature on oil-film resistance. 

It may be concluded, therefore, that the measurement of film 
thickness using untreated oil would lead to unreliable results. 
This is due to the fact that the oil-film resistance, for specific 
operating conditions, depends on its electrical history, that the 
rate of change of resistance with film thickness (if reliable) is very 
high, and that the oil-film resistance is seriously affected by the 
oil temperature. 


Behavior of Treated Oil 


Castrolite 20 A oil is used with 1, 2, and 4 per cent sodium 
petroleum sulphonate as additive. 

The resistance of treated oil is found to remain constant over 
a certain range of the applied potential, Fig. 6. The value of the 
breakdown voltage is shown to be nearly the same for the various 
values of film thickness investigated with either a-c or d-c po- 
tential applied across the gap between the disks. 

It can be seen readily that, for the same film thickness and 
applied potential, the electrical resistance of treated oil is much 
less than that of the untreated oil, i.e., it becomes more conduc- 
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Oil-Film Thickness 


Fig. 5 Variation of oil-film resistance with film thickness and oil tem- 
for untreated oil. (The voltage is brought to zero each time the 
film thickness is reduced.) 
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d.c. potential 


Fig. 6 Resistance-voltage characteristics of Castrolite 20 A oil with 1 
per cent sodium petroleum sulphonate; oil temperature: 55 F 
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volts Film Thickness X 10? inches 


‘1% Additive : o :50 F x:160°F 
2% Additive : :50°F +:160F 
6% Additive :50F :160°F 


Fig. 7(b) d-c potential 
Maximum applied potential: 0.75 volt for 1 per cent additive 
Maximum applied potential: 1.25 volt for 2 per cent and 4 per cent 
additive 


Variation of film resistance with film thickness for Castrolite 20 A oil with 1, 
2, and 4 per cent sodium petroleum sulphonate. 
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Fig.7 (a) a-c potential 
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Maximum applied potential: 0.5 volts for 1 per cent additive 
Meoximum applied potential: 0.75 volt for 2 per cent and 4 per cent 
additive 
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Fig. 8 Extended film resist thick characteristic 
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tive. Moreover, the temperature seems to have but a negligible 
effect on the resistance of treated oil, Fig. 7. 

It also has been found that, for given film thickness, the re- 
sistive behavior of treated oil is almost unaffected by the removal 
and reapplication of potential, even if it exceeds the breakdown 
value. For the same gap and applied potential, oil-resistance 
values under d-c potential are noted to be, in general, lower 
than those obtained with a-c potential. 

The increase in strength of the additive in the mixture (up to 
4 per cent) is found to lead to a related increase in the value of the 
breakdown voltage, i.e., to an extension in the range of ohmic re- 
sistance behavior of the oil, and a reduction in the rate of change 
of oil resistance with film thickness, Fig. 7. 

In the course of tests run on the rotating-disk testing machine 

2], it was found necessary to extend the film resistance-thickness 
characteristic to include film-thickness values up to 0.008 in. 
Such a calibration curve is shown in Fig. 8. 

The effect of film extent on both sides of the line of nearest ap- 
proach was investigated in the following manner: 

The bath was first filled with oil to a level much higher than 
that of the intermediate plane between the disks, the current 
passing through the film being measured for given gap and applied 
potential. The oil was thereafter drained till the bath became 
empty. The current then flowing in the short oil film suspended 
between the disks was found to remain exactly the same as that 
measured before. This would imply that for such thin film the 
current flows in the region closest to the line of nearest approach. 

An alternative additive, namely, naphthalene tetrachloride in 
the strength of 1 per cent, was investigated; however, no effect 
on the electrical behavior of the oil could be detected. On in- 
creasing the strength to 1.5 per cent, the additive was found to 
form a layer of precipitate on the disks, and no reliable results 
could be obtained. 

The results of tests conducted on the stationary-disk apparatus 
described previously, were applied to the running-disk machine [2], 
the lubricating oil, Castrolite 20 A, being admixed with 4 per cent 
sodium-petroleum sulphonate, a-c or d-c potential (not exceeding 
0.75 and 1.25 volts, respectively) being applied across the gap 
between specimen disks. Fortunately, no difficulty due to pre- 
vious electrical history of the oil was encountered in the running 
machine on account of the continuous replenishment of oil therein. 

It should be noted that the curves relating film thickness to film 
resistance are obtained for idle films under no pressure, films 
which do not exactly conform to conditions prevailing in the ro- 
tating-disk machine [2]. Test results obtained, however, on this 
latter machine show similar resistive behavior of the oil film as 
that obtained for the stationary disk apparatus. 


Conclusion 


The measurement of oil-film thickness by electrical conduc- 
tivity method would lead to unreliable results should straight 
mineral oil be used. 

The treatment of oil with certain additives is found, however, 
to impart to the oil fundamental favorable features which render 
it quite suitable for use with this method of measurement. 

Admixing Castrolite 20 A oil with sodium-petroleum sulphonate 
(in the strength of up to 4 per cent) greatly increases its electrical 
conductivity and extends its range of ohmic-resistance behavior 
for the same gap and applied potential. The additive seems to re- 
lieve the oil almost completely from its dependence on the elec- 
trical history and temperature. The resistive behavior of treated 
oil, for a given film thickness, is found to be almost unaffected by 
the removal and reapplieation of potential, even if it exceeds the 
breakdown value. The additive also makes more gradual the 
rate of change of resistance with film thickness for the same ap- 
plied potential. 


16 / MARCH 1960 


Eventually, a reliable relationship between the oil-film re- 
sistance and the oil-film thickness would emerge and, conse- 
quently, the method of electrical conductivity could be applied 
successfully to the measurement of oil-film thickness between 
gear teeth. 
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DISCUSSION 
C. M. Allen® 


The authors are to be complimented on their unique approach 
to solving the age-old problem of measuring lubricant film thick- 
ness of a static and dynamic sliding-rolling system. The authors 
not only confirm the results of earlier workers but go beyond them 
in the business of electrical inference of lubricant film thicknesses. 

The data presented in Figs. 7(@ and 6) are convincing proof that 
the effects of temperature between 50 and 160 F are negligible. 
This appears to be true for variable concentrations of additive, 
at least between 2 and 4 per cent. 

Of interest also, are the comparative results using the a-c and 
d-c electrical potentials. Invariably, the indicated resistance of 
a given oil film is greater using a-c than for d-c. However, at 
thin films of the order below 200 microinches the two begin to 
coincide. The discusser conducted a considerable number of 
minimum oil-film-thickness experiments on full cylindrical sleeve 
bearings a number of years ago.? The a-c method of determining 
minimum lubricant film thickness was selected because it was felt 
that the d-c methods might not provide as realistic data. The 
thought was that as the electrica! impulse passed through or punc- 
tured the oil film the d-c potential would have a tendency to keep 
the film punctured as the film thickness increased slightly. In 
contrast, the a-c method alternately punctured the film but per- 
mitted the puncture to heal between cycles. If this is true, one 
would expect to have higher indicated resistance with a-ec than 
with d-c. It should be added that attempts to make the results 
precisely quantitative were not particularly fruitful. When at- 
tempts were made and published,’ the results were politely eriti- 
cized as they should have been because no data were available at 
that time concerning the effects of shear, pressure, and tempera- 
ture of the film. Later on, a new approach to the problem was 
made and was moderately successful.‘ The oils used were 
petroleum base and no additives were used. 

The present authors are urged to provide an enlarged plot of 
oil-film resistance versus film thickness in the range of zero to 200 
microinches. It is in this region that many rolling and sliding- 
rolling machine elements operate; e.g., bearings and gears. This 
regime is of primary importance, since this is the region ap- 
proaching nonhydrodynamic lubrication, say at film thicknesses 
below 50 microinches. It would not be surprising if the elec- 


2 Division Consultant, Battelle Memorial Institute, Columbus, 
Ohio. 

*C. M. Allen, “The Dielectric Strength of Oil Films in Plain Bear- 
ings,’’ Mechanical Wear, American Society for Metals, 1950, pp. 181- 
191. 

4R. W. Dayton, C. M. Allen, et al., “A Fundamental Study of 
Sleeve Bearing Behavior,’’ Office of Naval Research, Contract N5 
ori-111, Battelle Memorial Institute Report, May 16, 1952. 
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trical-conductivity methods employed by the authors ran into 
difficulty in this zone because many localized asperities will pro- 
vide an increased number of electrical paths, thereby providing 
a multitude of instantaneous zero-resistance points. The cathode- 
ray oscilloscope would be a better detector than a voltmeter since 
the voltmeter will only average the indicated resistance or po- 
tential. 

The recent results presented by Rogers, Siripongse, and 
Cameron® in the August | issue of Engineering also appear to offer 
some new data and hope for electrical detection of oil-film thick- 
ness. Within the range of their experimental data, the effects of 
shear, pressure, and apparently temperature do not appear to 
hamper interpretations of their d-c discharge potential on film 
thickness. However, no data are presented for film thicknesses 
below 150 microinches. 


M. D. Hersey’ 


The authors are to be congratulated on establishing a relation 
between film-thickness and electrical resistance, capable of leading 
to the useful results reported in their companion paper.’ In 
effect, they have removed the difficulty previously associated 
with the application of Ohm’s law. It would therefore be of in- 
terest to compare their findings with calculation. An approxima- 
tion may be offered by treating the circumference of each disk as 
a parabola near the line of centers, and assuming that all the 
elements of current flow are parallel to that line. The total re- 
sistance R may then be expressed by integration in terms of the 
resistivity of the oil p, the film-thickness hp at the point of nearest 
approach, disk diameter D, axial length L of the film at right 
angles to the motion, and its breadth B, or extent at right angles 
to the line of centers. Thus 


p he 
£(*) 


where A denotes the angle whose tangent is B/(2Dh,)'*. As B 
approaches zero, Equation (1) approaches the limiting form 


R = pho/ BL (2) 


This agrees with the known expression for the resistance of a 
rectangular block of height hy and cross-section BL. 

It would be of value to have numerical data on the extent B of 
the film remaining in suspension after draining the bath. We 
should expect that B will depend on the surface tension and vis- 
cosity of the oil and the motion of the disks. A comparison of 
Equation (1) with Fig. 7 indicates that B decreases with increas- 
ing ho. Equation (1) seems to confirm the fact that the resist- 
ance is but slightly affected by the film extent when B is suf- 
ficiently great, but the formulas presuppose a knowledge of B. 


Ragnar Holm® 


A condition for the reliability of the method is that the lubri- 
cant has a resistivity p that does not depend on the thickness of 
the film. A judgment on this condition has to be based on 
calculations of the following type. 

Let us assume ideally smooth rings with parallel axes; thick- 
ness 6; radius r; shortest gap h. It is readily shown that the 

5 P. R. Rogers, C. Siripongse, A. Cameron, ‘“Thin Film Lubrication: 
1—Discharge Through Thin Oil Films,"’ Engineering, vol. 186, 1958, 
pp. 146-147. 

6 Visiting Professor of Engineering, Brown University, Providence, 


7*Performance Characteristics of Lubricating Oil Film Between 


Rotating Disks,”’ published in this issue, pp. 19-28. 
* Stackpole Carbon Company, St. Marys, Pa. 
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gap at a point z (see Fig. 9) is approximately z*/r + h. Thus 
the total resistance across the film is approximately 


+ rb r 


Fig. 10 shows that the calculated R plotted against h is repre- 
sented by a curve of another type than was measured. I would like 
to hear the author’s comments concerning the discrepancy be- 
tween theory and measurement. 


H. Blok’ 

The electrical resistance method was perhaps the first ever to 
be tried with some degree of success” for measuring thickness of 
lubricant films. But, as the authors have pointed out, this 
method, owing to certain drawbacks, has become increasingly 
unpopular. The authors deserve much merit for reviving the 
method by introducing the artifice of an additive that decreases 
the electrical resistance of the test oil by several orders of magni- 
tude, and thereby removes what have previously been perhaps 
the worst drawbacks of the method. 

There remains, however, one important question, that is, 
about the correctness of the authors’ interpretation of the 
resistances measured in their various tests in terms of minimum 
film thicknesses. This question is all the more important be- 
cause their concurrent paper" is also subject to it. 

In this connection it should be observed that the authors’ 
calibration measurements have been done with unloaded disks, 
in an attempt to find minimum film thickness for films between 
disks that were loaded, sometimes rather heavily. Therefore 
it should be taken into account that, even at equal minimum film 
thickness, an unloaded film between two disks, or other curved 
surfaces, will have a geometry that may be quite different from 
that obtaining with a loaded film. The reason is to be sought in 
the elastic deformations that loaded rubbing surfaces will undergo 
in their region of closest approach. Particularly at the heaviest 
loads applied in the authors’ tests (see also their concurrent 
paper)" local elastic flattening of the disk surfaces in their 
region of closest approach must have been considerable. Then 
the gap, occupied by the lubricant film between the surfaces, in 
that region was no longer bounded by the originally circular ares, 
but rather by two nearly parallel plane surfaces. 

Unless the authors should be able to show that, even at the 
highest loads applied in their tests, elastic flattening was in- 
appreciable, their conversion of the resistance measurements in 
terms of minimum film thickness would appear to be open to 
some doubt. As can easily be shown, the authors’ minimum film 


* Professor of Mechanical Engineering, University of Technology, 
Delft, Holland. 

” See, for instance, H. Schering and R. Vieweg, ‘“‘Uber die Beur- 
teilung der Lagerschmierung nach elektrischen Messungen,” Zeit- 
schrift fir Angewandte Chemie, vol. 39, 1926, pp. 1119-1123; see also 
the many references given in this paper to earlier work. 
11 Published in this issue, pp. 19-28. 
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thickness would otherwise be on the high, that is, optimistic 
side. 

Perhaps a better approximation to the actual minimum film 
thicknesses could be obtained by assuming, at least for high 
enough loads, the gap between the disks to be planparallel and to 
have a width, measured along the periphery of the disks, equal 
to that of the contact area that would occur according to Hertz’s 
theory for no oil at all. This method, approximative as it is, 
might well result in estimates of minimum film thickness that 
are more reliable than the present ones, even though one would 
neglect edge effects such as occur on either side of the supposedly 
planparallel gap owing to the curving away of the disk surfaces. 
The electrical resistivity of the test oil, which would be needed 
in carrying out such an estimate, might be determined from any 
suitable resistivity test.* There might also be a possibility of 
finding the electrical resistivity from the authors’ calibration 
measurements with the unloaded disks. Then a formula should 
be available that relates the electrical resistance measured to the 
gap thickness (minimum film thickness), the radii of the disks, 
and the electrical resistivity of the oil. Such a formula was, in 
fact, developed by Schering and Vieweg (see the first reference of 
this discussion) for their case of a cylindrical journal bearing 
where one cylindrical “electrode” (the bearing surface) enveloped 
another (the journal), at least partially. The formula for the 
present case, where the cylindrical electrodes are external to each 
other, can be found by essentially the same method, which, for 
instance, can be derived from potential theory by using the tech- 
nique of conformal mapping. But more direct determinations of 
the electrical resistivity of the oil are also conceivable,!* so that 
one is not wholly dependent on the authors’ calibration measure- 
ments and on the formula involved. 

Calibration measurements on disks during running, as a sup- 
plement to the present measurements on stationary disks, could 
perhaps be performed by using a technique in which a beam of 
x rays is projected onto one side of the disks and the remaining 
radiation intensity is measured on the other side, that is, where 
what is left of the beam emerges from the film. As soon as some 
such calibration would be available, one could further rely on the 
much more convenient method of the authors. 

It is hoped that the authors will succeed in developing their 
ingenious and attractive method to the extent where any reasona- 
ble doubt about its reliability is removed. 


12 J, L. van der Minne and A. Klinkenberg, ‘Electrical Statics in 
the Petroleum Industry. The Prevention of Explosion Hazards,” 
Elsevier, Amsterdam, Holland, 1958. In this book many additives 
are mentioned that, like the petroleum sodium sulphonate used by the 
authors, decrease the electrical resistivity of petroleum products. 
Some of these additives might perhaps be suitable also for the 
authors’ purpose. 
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Authors’ Closure 


The authors wish to thank contributors to discussion for their 
kind comments, interest, and criticism. 

In reply to the points raised by Mr. Allen, we would like to 
point out that the method of electrical conductivity was pri- 
marily developed and used by the authors for measuring hydro- 
dynamic films between rotating disks (simulating gear teeth) 
[2]. We do agree, however, in that the development of suitable 
measuring technique for film thicknesses in the range zero to 200 
microinches is essential for the study of conditions approaching 
boundary lubrication. Such technique would evidently necessi- 
tate more sensitive instruments. It is fortunate that wider 
choice of additives has recently become available.** 

Analytical treatments of the problem of electric current flow 
through the oil film between disks, such as those mentioned by 
Professor Hersey and Mr. Holm, are based on assumptions that 
may, in part, be not quite valid 

Experiment, as mentioned in the text of the paper, implies that, 
for thin films encountered in the tests, current flows in the region 
closest to the line of nearest approach. In other words, the extent 
of the effective oil film through which current flows are extremely 
small to the extent that the actual disk surface profile across 
current flow—whether parabolic as assumed by Professor Hersey, 
circular as worked out by Mr. Holm, or even flat due to elastic 
deformations under externally applied load—plays a negligible 
role in oil-film resistance. Consequently the resistance would be 
expected to have simply a linear relationship with film thickness. 

At relatively small gaps, however, it is feasible to expect that 
relative proportions of particles of the insulating oil and those of 
the conductive additive in the mixture, through which current 
passes, would have a more significant effect on film resistance, as 
would the resistance at the boundary layer 

At relatively large gaps, on the other hand, the oil-film sus- 
pended between the disks may narrow down to the extent that 
the effective core area, through which current flows, may be re- 
duced thus leading to increase in the rate of change of resistance 
with oil-film thickness. 

It is believed that through intensive study of factors, such as 
those outlined, that theory may be brought in closer agreement 
with experiment. 

The authors are particularly indebted to Professor Blok for his 
valuable suggestions regarding further development and elabora- 
tion of their method, which suggestions will be carefully con- 
sidered in relevant future work. 


13 J, L. van der Minne and A. Klinkenberg, “Flectrical Statics in the 
Petroleum Industry; The Prevention of Explosion Hazards,” 1958, 
Report, Elsevier, 
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Performance Characteristics of 
Lubricating Oil Film Between Rotating Disks 


A testing machine has been designed to investigate the performance of the lubricating 
oil film between two independently driven circular disks in line contact under conditions 
of combined rolling and sliding, in an endeavor to study the behavior of meshing gear 
teeth with adequate supply of lubricant. 


Test results show that the ratio of sliding to rolling velocity is a prime factor in the 
behavior of rotating disks. Values of this ratio equal to zero and unity (conditions of 
pure rolling and pure sliding, respectively) give rise to critical changes in the per ‘ormance 
of the disks, being most favorable, from a lubrication viewpoint, at pure rolling (4.e., at 
the gear pitch line) and least favorable at pure sliding where scuffing of surfaces may well 


be expected 


Recent theoretical investigations do not seem to be able yet to account for salient results 


Introduction 


HE FORMATION of hydrodynamic lubricating films 
between gear teeth presents a most interesting and rather com- 
plicated problem from both theoretical and experimental view- 
points. 

On the theoretical side, the main difficulties seem to include 
the variation of lubricant viscosity with film temperature and 
pressure, the effect of surface roughness and deformation under 
load, the mechanism of film formation, and the extent of operative 
film between mating surfaces. 

Theoretical studies in gear lubrication were first started in 1916 
by Martin who obtained the equation for minimum film thick- 
ness between a rack and a pinion [11].' Later in 1926, Karlson 
published his work on disk lubrication [9]. Subsequent con- 
tributors to the theory include Gatcombe [5], Weber [17], 
Cameron [1, 2, 3], McEwen [12] and Lewicki [7]. 

On the experimental side, the development of satisfactory 
apparatus for investigating the performance of lubricating film 
between gear teeth presents certain practical difficulties. It is 
evident that the use of actual gears is hampered by the high cost 
incurred in the production of highly accurate gear profiles. 
Moreover, measurements in the vicinity of tooth contact are not 
easy to make. 

These drawbacks led to the development of testing machines in 
which cylindrical disks are used to simulate gear-teeth action 
kinematically. This idealization greatly simplifies relevant in- 
vestigations. 

Numerous disk machines were designed primarily for testing 
gear materials and lubricants, of which the machines reported by 
Merritt [14], Nishihara and Kobayashi [15], Way [16], Cameron 
{2}, and Evans and Tourret [4] could produce either pure rolling 
of disks or rolling combined with constant relevant proportion of 
sliding. On the other hand, machines developed by Meldahl 
{13], Manhajm and Mills [10], and Cameron [3] could produce 
cyclic speed variation of one of the disks. In all these machines 
both disks were driven by one motor, an arrangement which 


1 Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division and presented at the 
Annual Meeting, New York, N. Y., November 30—December 5, 1958, 
of Tae American Society or MecHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, February 
20, 1958. Paper No. 58—A-254. 
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obtained in the present study. Further work on the subject is proceeding. 


renders the measurement of frictional drag on the individual disks 
impossible. 

A general picture of the variation of film thickness along the 
profiles of actual gears during mating period was obtained by 
Hughes [6]; no absolute values of film thickness could, however, 
be determined. 

It may be concluded from this brief survey that little attention 
has yet been devoted to the problem of fluid-film formation be- 
tween mating gear surfaces, a fact which aroused considerable in- 
terest in planning and carrying out a quantitative and systematic 
investigation of the relatively simple case of hydrodynamically 
lubricated cylindrical disks in an endeavor to determine the per- 
formance of lubricating films between continuously meshing gear 
teeth. 

The present paper is concerned with the design and develop- 
ment of a testing machine with two independently driven disks 
which can be made to run under conditions of combined rolling 
and sliding in line or point contact, fluid-film thickness being 
measured by the electrical conductivity method. 

Performance characteristics of the lubricating film between 
specimen disks (viz., minimum film thickness and friction) are 
obtained firstly, at and near pure rolling of disks and secondly, at 
and near pure sliding with surfaces of the disks in line contact. 
Test results are further compared with some recent theoretical 
studies. 


Nomenclature 


F,. = frictional forces on disks 1 and 2 
ho = minimum thickness of film between disks 


K, = specific sliding ratio = sliding velocity/rolling velocity 
Ky = specific sliding ratio referred to disk 1 
= — Us)/U, 
= (N, — N;2)/N, for disks of same diameter 
Ka = specific sliding ratio referred to disk 2 
= (U, — U,)/U2 
= (Nz, — N,)/N; for disks of same diameter 
Ni. = rotational speeds of disks 1 and 2 
P = load applied per unit axial width of disks 
T; = inlet oil temperature 
U;2 = circumferential velocities of disks 1 and 2 
W = total external load acting on disks 
AX = coefficient of viscosity of lubricant 
1,2 = coefficients of friction for disks 1 and 2 = F,./W 
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Description of the Apparatus 


General Arrangement. A schematic arrangement of the appara- 
tus is shown in Fig. 1, general front views in Figs. 2 and 3, and a 
sectional elevation in Fig. 4. 

The two specimen disks (1) «ad (2) are mounted on two hori- 
zontal shafts (3) and (4) with their axes in the same vertical 
plane. Both disks are made of case-hardened mild steel and 
measure 6 in. in diameter and 2 and 2!/s in. in width (for top and 
bottom disks, respectively). They are located axially on their 
shafts by means of nuts and are electrically insulated therefrom 
by perspex bushes and washers. 

The bottom shaft is driven by a 5-hp variable-speed a-c torque 
dynamometer motor (5) supported on double-row self-aligning 
ball bearings (8). The motor carries a heavy flywheel (7) which 
helps to maintain uniform disk speed under load, serves as a brake 
pulley, and also forms one face of a flexible coupling (6). The 
shaft is supported on deep-groove ball bearings (9) and carries, at 
its free end, the slip-ring assembly (11); an extra pair of bearings 
(10) identical to bearings (9) is mounted on the shaft for calibra- 
tion purposes. 

The top shaft (4) is driven independently by a 4-hp d-c motor 
(12) (speed controlled by a Ward-Leonard system), via flywheel 
(13), inelined V-belt (14), back shaft (15), and horizontal flat end- 
less belt (17). The top shaft is supported on double-row self- 
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Fig. 1 Schematic arrangement of the apparatus 


(1) Bottom specimen disk 
(2) Top specimen disk 
(3) Bottom shaft 


(11) Slip-ring assembly 
(12) d-c motor 
(13) Flywheel 


(4) Top shaft (14) V-belt drive 
(5) Torque dynamometer, a-c (15) Back shaft 
motor (16) Journal bearings 


(6) Flexible coupling 
(7) Flywheel 
(8) Rolling bearings 
(9) Deep-groove ball bearings (20) Spring 
(10) Same as (9) for calibration (21) Counter weights 
purposes (22) Balancing weight 
(23) Slip-ring assembly 


(17) Flat belt drive 
(18) Loading levers 
(19) Supporting levers 
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aligning ball bearings housed in two levers (19) pivoted at the 
back shaft and counterbalanced by a spring (20). The center of 
gravity of the shaft can be brought to the mid-plane between the 
two supporting levers by means of a balancing disk weight (22), 
which can be moved along the screwed free end of the shaft. 

The force of spring (20) balances the overhanging weight of 
levers and top-shaft assembly so that relevant center of gravity is 
made to lie on the back-shaft axis midway between the supporting 
levers. In this way the top shaft can be adjusted to be in 
equilibrium in the horizontal position with the two disks just 
touching with no vertical component of load acting between 
them. 

The arrangement allows the top shaft to swivel slightly in the 
vertical plane, so that the externally applied load may be trans- 
mitted along a line of contact for the specimen disks. 

A screw jack placed under the supporting levers (19) and fixed 
to the base plate of the machine, is used to lift the top-shaft as- 
sembly to clear the top disk off the bottom one. 

A second slip-ring assembly (23) is mounted on the top shaft. 
In this assembly the position and inclination of the brushes were 


Fig. 3 General view of top and bottom shafts 


Transactions of the ASME 


s 
: 
* 
A 
— 
2. 
t 


so arranged as to allow for measurements to be taken for both 
directions of rotation for the shaft. 

The construction of the machine also allows the back-shaft 
assembly to turn through a few degrees about a vertical axis lying 
in the central plane of the specimen disks, so that these may be 
made to touch along a line or a mid-point. 

The steady vertical load is applied to the top shaft by means of 
two levers (18) through two 180-deg journal bearings mounted 
at the ends of two independent levers, Fig. 2, pivoted at the back 
shaft and balanced at their extreme ends by a spring. 

Loads of up to 2 tons may be applied to specimen disks by 
hanging dead weights from levers (18) which are completely in- 
dependent of each other and can turn freely from pivot bearings 
fixed at the sides of the oil tank. Each lever is balanced about 
its pivot by means of a dead weight (21) attached to its extreme 
end 

Two small struts of adjustable lengths are used to transmit 
loads from levers (18) to the loading half bearings which rest on 
the top shaft. Each strut has a knife edge on the lever side (with 
V-seating in lever) and a ball on the bearing side (with conical 
seating in bearing housing), so that when the levers are set hori- 
zontally, the load will be applied to the top shaft along two 
vertical lines passing through the centers of loading bearings. 
It is worthy to note, in this respect, that Merritt [14] found that 
the use of two levers would lead to more uniform load distribution 
at contact surfaces than that obtained by the use of only one lever. 
The levers are so designed as to eliminate the moment of friction 
forces at the knife edges about pivot axes, when in the horizontal 
position. 

The specimen disks are lubricated with Castrolite 20 A oil con- 
taining 4 per cent sodium-petroleum sulphonate as additive. Oil 
is supplied, under pressure, and sprayed on either or both sides of 
the disks through suitable nozzles and control valves. Both 
heating and cooling arrangements are incorporated in the system. 

Instrumentation. The film thickness between specimen disks is 
obtained at the line of closest approach by the electrical conduc- 
tivity method in which a potential is applied across the film and 


- 


the resulting current is measured. Owing to the relatively high 
resistivity of a straight mineral oil, an additive is used to increase 
its electrical conductivity. Current flow through the film evi- 
dently depends on both film thickness and its extent along the 
disks. It has been found, however, that the effect of film extent 
may well be neglected in the case of such thin film. Either a-c 
or d-c supply may be used with voltages not exceeding 0.75 and 
1.25 volts, respectively. 

The whole slip-ring assembly forming part of the electric circuit 
for conductivity measurements is mounted on the shaft through 
ball bearings so as to damp any variation in pressure between slip 
rings and brushes should the shaft vibrate. Slip rings are made 
of stainless steel and brushes of silver graphite. 

Current measurements are interpreted in terms of actual film 
thickness with the help of calibration curves obtained on a “pilot 
static disk apparatus,”’ specially designed for the purpose.* 

Frictional torque on the lower disk is obtained by measuring 
the balancing torque at the bottom shaft motor and allowing for 
predetermined losses in the ball bearings mounted on the shaft. 

Oil temperatures at the main tank and at the outlet of the two 
spray nozzles are measured by thermocouples connected, through 
a selector switch, to a millivoltmeter. 

Rotational speeds of top and bottom shafts are measured by 
generator-type tachometers connected to voltmeters. 


Test Procedure 


The balancing disk weight was initially adjusted in position so 
that the center of gravity of the top shaft was brought to the mid- 
plane between the two supporting levers. 

The whole of top-shaft assembly and supporting levers were 
then counterbalanced about the back shaft; so were the levers 
carrying the two loading bearings about their pivots, relevant 
springs being adjusted to effect balancing. 


?Greater detail of this apparatus and relevant investigations is 
included in the paper, ‘‘Measurement of Oil-Film Thickness Between 
Disks by Electrical Conductivity,’’ published in this issue, pp. 12-18. 
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Fig. 4 Sectional elevation of disk assemblies 
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The two specimen disks were set apart and their axes adjusted 
parallel with the help of a plane surface gage and light source. 

The top and bottom shafts were made to run at their desired 
speeds, and the top shaft was gradually brought closer to the 
bottom shaft until specimen disks touched under no external load. 
The oil supply was switched on to admit oil under pressure to the 
contacting surfaces. 

Loading levers were adjusted to the horizontal position before 
proper weights were hung therefrom to apply the desired loads to 
specimen disks. 

A constant d-c potential of 0.5 volt was, in all tests, applied 
across the film. 

The apparatus was left to run under load, the various operating 
conditions being adjusted to attain the steady state. 

A complete set of observations was taken under the same im- 
pressed conditions. This included the measurement of current 
flow through the film, frictional torque at the bottom disk, speeds 
of top and bottom shafts, and the oil-inlet temperature. 


Scope of Experimental Work 


Specimen disks are arranged to run in line contact, and their 
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Fig. 5 Disk speed conditions leading to pure rolling and pure sliding 
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Fig. 6 Typical curves of minimum film thickness at pure rolling; effect of inlet-oil temperature 


performance characteristics obtained for the range of specific 
sliding-ratio values of primary importance; namely, at and near 
K, = 0 and 1, which values correspond to conditions of pure 
rolling and pure sliding, respectively, Fig. 5. The study also 
included the investigation of the effect, on disk performance, of 
inlet-oil temperature, applied load, and disk speeds. 

Operating conditions were interchanged for the disks since 
frictional torque could only be measured at the bottom disk. 
Generally two tests were made, each arranged with one of the 
disks running at constant speed. 

Four values for the inlet-oil temperature were used in the ex- 
periments; namely, about 65, 85, 100, and 120 F. 

Loads applied to the disks, under conditions of pure rolling, 
were 107, 212, 332, 497, 632, and 812 lb. Higher loads, though 
possible to exert by the loading system, were avoided in the tests, 
on account of the large increase in oil temperature, the excessive 
reduction in film thickness, and the undesirable prolongation in 
time needed to reach the steady state. Under conditions of pure 
sliding, only very light loads (namely, 22, 47, 77, and 107 lb) were 
used, so as to avoid scuffing of contacting surfaces. 

Disk speeds could each be varied satisfactorily in the range 100 
to 450 rpm. The constant-speed disk could be either the top or 
bottom one when investigating pure rolling, and could only be 
the bottom disk when investigating pure sliding; this is due to 
the fact that bottom-shaft speed could not be varied between 100 
and —100 rpm with the existing motor. 

The speed of the constant-speed disk is referred to in this paper 
as “‘pure-rolling speed” since pure rolling of disks is only achieved 
when the variable-speed disk is made to rotate at the same rate 
as that of the constant-speed disk. 


Operation at and Near Pure Rolling 


Combined rolling and sliding were obtained in the experiments 
through speed variation of the two independently driven disks, 
pure rolling being attained when their speeds become equal in 
magnitude and opposite in sense, Fig. 5. 

Performance characteristics studied included the minimum film 
thickness ho and the frictional drag on the disks. A summary and 
a brief discussion of the salient results obtained follows. 

Minimum Film Thickness. Typical curves of minimum film thick- 
ness exhibit peak values at pure rolling (K, = 0), Figs. 6 and 8, 
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Fig. 7 Minimum film thickness at pure rolling 


Fig.8 Typical curves of minimum film thickness at pure rolling; effect 
of applied load 
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Fig. 9 Effect of applied load on minimum film thickness 
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~~ Fig. 10 Effect of pure-rolling speed on minimum film thickness 


film-thickness values receding therefrom as the specific sliding 
ratio deviates from the zero value. In general, the curves are not 
symmetrical about the ordinate K, = 0. This is partly due to the 
effect of absolute disk speed on film thickness (as will be shown), 
since the speed of the variable-speed disk is not the same at k, 
= +z. Values of he at the higher range of negative specific slid- 
ing-ratio values are generally higher than those at the correspond- 
ing positive K,-values, the variation in fi» in the former range 
being very gradual or even nil. 

For the same inlet-oil temperature, applied load, and pure 
rolling speed, and within the range of specific sliding-ratio values 
tested, the minimum film thickness is found to assume its lowest 
value at the greatest positive K,-value, and to attain its maxi- 
mum value at pure rolling. Lowest and highest values obtained 
for he in the tests are approximately 0.000025 and 0.008 in., 
respectively. 

Curves of minimum film thickness against inlet-oil temperature, 
Fig. 7, show a gradual decrease in he with increase in 7;, the 
temperature effect diminishing with higher positive K,-values, 
Fig. 6, and with heavier loads. This may be due to the fact that, 
under these operating conditions, the actual mean oil temperature 
in the film is so high that the change in inlet temperature has a 
small effect on viscosity and hence on film thickness. 

Test results show, as would be expected, that the greater the 
applied load, the smaller will be the minimum film thickness, Figs. 
8 and 9, and the more symmetrical will be the Ae-K, curve about 
the ordinate K, = 0, especially at higher inlet-oil temperatures. 
This latter feature implies that, at high loads and for the same 
inlet-oil temperature, and with one of the disks kept running at 
constant speed, Ao will be almost totally dependent on the 
numerical value of K, (relative to the constant-speed disk) irre- 
spective of the actual rotative speed of the variable-speed disk. 

Experiment also shows that, for same specific sliding ratio, ap- 
plied load, and inlet-oil temperature, the minimum film thickness 
increases with higher disk speeds, a trend in favor of Cameron’s 
theoretical predictions [1]. This speed effect is found to be more 
pronounced at the smaller values of K,, being greatest at pure 
rolling, Fig. 10, and to be less significant for heavier loads and at 
higher oil temperatures. 

Test results are summarized in Fig. 11, in which the minimum 
film thickness ho is plotted against the nondimensional term 
[P/A(U; + U2)] which appears in relevant literature. It can be 
seen therefrom that, for same specific sliding ratio, this nondi- 
mensional term provides a good means of correlation for test 
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Fig. 11 Minimum film thickness h) versus the nondimensional term 
(P/U, + U:)] 
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points obtained with various combinations of applied load, lubri- 
cant viscosity, and disk speeds. 

Recent theoretical investigations [3, 5, 17] do not seem to be 
able yet to account for the critical increase in film thickness at 
pure rolling, Fig. 12. It is thought, however, that closer agree- 
ment between theory and experiment may be achieved through 
more intensive study of the mechanism of film formation between 
the disks. 

The results agree with Hughes’ findings [6] in that the film 
thickness between meshing teeth of rotating gears increases with 
reduction in the applied load and oil temperature. They also 
agree in that the film thickness is greatest under pure rolling (in 
the neighborhood of the pitch line) and is smallest when sliding is 
maximum. 

Friction. It is found that, within the present range of experi- 
ments, the frictional forces on the two disks, except at pure roll- 
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Fig. 12 Comparison of minimum film-thickness results with theory 
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ing, always act in opposite directions and, in general, have differ- 
ent magnitudes. Each force acts on the respective disk in a 
direction opposite to that of its sliding velocity. 

Further, it can be easily shown, by a simple force analysis, that 
F, should be greater than F;when N; > N2 and vice versa; other- 
wise the resultant of film pressures would act at some point in 
the divergent part of the film. 

The frictional force is herein taken positive when acting in a 
direction opposite to that of the motion of the respective disk sur- 
face. 

Typical curves of the coefficient of friction against specific 
sliding ratio, Figs. 13 and 14, show that the frictional forces F; 
and F2 decrease as pure rolling is approached, and reverse direc- 
tions while passing through pure rolling at which condition they 
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Fig. 13 Typical friction results at pure rolling; effect of inlet-oil tem- 
perature 
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Fig. 14. Typical friction results at pure rolling; effect of applied load 
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vanish. Friction values become fairly constant at K,-values 
beyond about —0.3. 

Test results thus confirm the analytical finding just given in 
that, for same inlet-oil temperature, applied load and constant 
speed for one of the two disks, Ff, is always greater than F; when 
N, > Nz and vice versa. At pure rolling where = 0 = the 
resultant of film pressures would be expected to coincide with the 
line of centers. 

Experiment shows that friction increases with rise in oil tem- 
perature and decreases with increase in disk speeds (for the same 
K,-values), and that the coefficient of friction is nearly inde- 
pendent of the applied load; this latter result is in agreement with 
Cameron's findings [1 }. 

Curves of against the nondimensional term [P/A(U; + U2) ], 
Fig. 15, show an increase in yp, at first, followed by fairly constant 
values. The nondimensional term seems to give, for same 
K,-values, a good correlation between results ,obtained under 
various operating conditions 
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Experiment disagrees with Weber’s [17] and Cameron's [3] 
predictions, Fig. 16, in that friction forces on the two disks always 
act in opposite directions and in that they vanish at pure rolling. 


Operation at and Near Pure Sliding 


Performance characteristics are obtained for the interesting 
case in which the variable-speed disk is made to reverse its direc- 
tion of rotation, thus passing through the pure-sliding condition, 
K, = 1.0, at which the variable-speed disk ceases to rotate, Fig. 5. 

In the first test run, the upper disk was held stationary by 
applying the brake to the drum of the upper disk motor while the 
lower disk was made to run at 300 rpm. As soon asa load of 40 Ib 
was applied to the disks, seizure took place and the upper disk was 
driven by the bottom one against the force of the brake. Scuffing 
was found to develop on both disks along an axial band extending 
over most of their widths, Fig. 17. The disks were also damaged 
at several points near the edges. The fact that scuffing de- 
veloped along an axial band inspires confidence in the design and 
adjustment of the apparatus to achieve line contact between 
specimen disks. 


Fig. 17 Scuffing on upper disk at pure sliding 


The disks had to be reground, their diameters being reduced by 
0.017 in. to remove all traces of scuffing. Pure sliding was at- 
tained thereafter by running the upper disk at a positive rolling 
velocity, decreasing it gradually to zero, and then increasing it in 
the opposite sense. The upper disk speed could be varied between 
100 and —100 rpm by inserting additional pulleys between the 
upper disk motor and the back shaft. Unfortunately, the lower 
disk speed, in the present setup, could not be reduced to the same 
range, and consequently F; could not be measured. 

In this series of tests, the lower disk was made to run at the 
desired constant speed, while the v:per-disk speed was varied 
between 100 and —100 rpm. The nunimum film thickness ho 
and the frictional force on the lower disk F, were measured under 
the same impressed conditions. To safeguard against damage to 
disk surfaces, the applied load was removed whenever the mini- 
mum film thickness reached about 0.00002 in. 

Minimum Film Thickness. Test results show that ho is greatly 
reduced as the pure-sliding condition is approached, Figs. 
18, 19, and 20. It is found to increase, however, as the variable- 
speed disk sets to rotate in the opposite sense, and to become al- 
most constant at K,-values beyond about 1.1. 

Test results also show that, for same K,-value, the minimum 
film thickness increases with increase in rolling speed and with re- 
duction in inlet-oil temperature and applied load. 

Friction. Curves of friction coefficient (for the constant-speed 
disk) versus specific sliding ratio, Figs. 21 and 22, exhibit peak 
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values at the pure-sliding condition, followed by constant values 
beyond K, ~ 1.1. 

Friction is found to increase with rise in oil temperature, and 
with reduction in rolling speed. The coefficient of friction ap- 
peared to be nearly independent of the applied load. 


Summary and Conclusion 


Performance characteristics for two independently driven 
circular disks in line contact are obtained under conditions of com- 
bined rolling and sliding with adequate supply of lubricating oil. 
Typical performance curves show that, within the range of 


Fig. 18 Minimum film thickness at pure sliding; effect of inlet-oil tem- 
perature 
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Fig. 19 Minimum film thickness at pure sliding; effect of applied load 
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Fig. 20 Minimum film thickness at pure sliding; effect of rolling speed 


present tests, two critical changes are noted; one at pure rolling 
(K, = 0) where the oil film is thickest and the frictional forces are 
nil, and the other at pure sliding (K, = 1) where the film is thin- 
nest and frictional forces are greatest, e.g., Fig. 23. 

For the same specific sliding-ratio value, the minimum film 
thickness hp is found, in general, to increase with increase in oil 
viscosity and disk speeds, and to decrease with the application of 
higher loads. On the other hand, friction is shown to decrease 
with increase in oil viscosity and disk speeds, and to increase with 
load at an almost uniform rate; i.e., coefficient of friction nearly 
independent of load. 

In general, frictional forces on the two disks are found to be 
different in both magnitude and sense. Each force acis on its 
disk in a direction opposite to that of its sliding velocity. The 
two forces vanish at pure rolling and increase sharply with posi- 


Fig. 21 Friction at pure sliding; effect of inlet-oil temperature 


Transactions of the ASME 


| 
RIT | 
4a 
T 
20 as + 
1.8 : 06 | + 
N, = 300 rpm | | | 200 
. W = 22 Ibs 
s Woe | 
| | | 
| 06 OF 08 09 10 19 
1.0 5; 
| \ | | 
06 
\ 
| 
0.2 
06 0.7 08 as 1.0 1.2 13 
K, 
2.0 
| 
N, = 300 rpm 
< | | > 
: | | 60 
— + 4 
08 | \\\ 0.03 Tt 
+ 
06 + + + 9° ° N, = 300 rpm 
| W = 22 lbs { 
| | | | | | | | 
0.2+——— 4 | i 
K, 5, 
it 
| 
‘de 


N,= 200 ram — 
——— + 


Fig. 22 Friction at pure sliding; effect of rolling speed 


tive or negative values of specific sliding ratio. Friction values 
become fairly constant beyond K, = —0.3. 

Test results, in general, do not confirm recent theories put for- 
ward by Gatcombe [5], Weber [17], and Cameron [3], which 
theories do not predict the experimentally revealed critical 
changes in performance of disks at pure rolling and pure sliding. 
Closer agreement between theory and experiment may well be 
achieved through more intensive study of film formation between 
the disks. 

Test results agree, however, with Hughes’ findings [6] in that 
the film thickness between meshing teeth of rotating gears is 
greatest in the neighborhood of the pitch line, i.e., at pure 
rolling, and is smallest when sliding is maximum. 

Pure sliding, obtained in the tests with one of the disks station- 
ary, is shown to result in a serious reduction in film thickness and 
a related increase in friction; these severe conditions may well 
lead to scuffing of disk surfaces. 

It may be concluded therefore that, with adequate supply of 
oil, the disk surfaces may be separated by a hydrodynamic film 
of lubricant, conditions being most favorable for film formation 
when specific sliding is zero (at gear pitch line), and least favora- 
ble when one of the disks ceases to rotate. 
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DISCUSSION 
H. Blok? 


In the writer’s discussion on the authors’ concurrent paper‘ 
some doubt has been expressed about the correctness of the 
authors’ method of interpreting their electrical resistance measure- 
ments in terms of minimum film thicknesses. Therefore, although 
it is tempting to embark upon a correlation of the authors’ re- 
sults with those recently obtained also by others,®* it is perhaps 
wiser to await the authors’ reply and the corrections that might 
possibly have to be applied. For the same reason, the writer will 
confine himself to raising only the following two points. 

In his afore-mentioned discussion the writer has gone into the 
question of how to account approximately for the effect of elastic 
flattening of the rubbing disks in their region of closest approach, 
and thus, fora presumably significant effect of load, in estimating 
minimum film thicknesses on the basis of the electrical resistances 
measured. Viewed in that light, it would appear that the 
authors’ estimates may well be on the optimistic side, and the 
more so the higher the load. In other words, the sensitivity of 
minimum film thickness toward variations in load is expected 
to be less than would appear from Figs. 7 and 8. Particularly at 
the highest loads applied, this sensitivity might well be very 
much less. It would even appear that the possibility cannot yet 


* Professor of Mechanical Engineering, University of Technology, 
Delft, Holland. 

‘ Published in this issue, pp. 12-18. 

5’ A. W. Crook, discussion to Paper No. 33, “Lubrication as a Gear 
Design Factor,”’ presented by H. Blok before the International Con- 
ference on Gearing, The Institution of Mechanical Engineers, London, 
England, September 23-25, 1958. 

* Yu. A. Misharin, “‘The Influence of the Rubbing Conditions on 
the Magnitude of the Friction Coefficient in the Case of Rolling With 
Sliding,” International Conference on Gearing, The Institution of 
Mechanical Engineers. London. England, September 23-25, 1958. 
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be ruled out that in that load range minimum film thickness 
would eventually prove to be nearly independent of load (com- 
pare the results obtained by a capacitance method under some- 
what comparable conditions, by A. W. Crook®). 

The second and last point to be raised relates to the frietion 
measurements. The authors have stated that under pure rolling 
conditions friction vanishes. This is indicated also in Figs. 13, 14, 
and 16, in which the curves depicting the trend of coefficient of 
friction have been drawn right through the origin. Have the 
authors perhaps meant to say, more exactly, that friction was so 
small that it could not be measured? It can in fact be proved, 
on the basis of the law of conservation of energy, that under fully 
hydrodynamic conditions rolling friction can never vanish in any 
exact sense. The explanation is that the lubricant will flow in the 
supposedly hydrodynamic film so that mechanical energy will be 
converted by viscous shear into heat. However, input of the 
mechanical energy, required for maintaining the flow, can take 
place only in the form of shearing stresses, or say nonzero fric- 
tion, which, owing to the symmetry under rolling conditions, will 
be exerted by either disk on the film to equal amounts. Sliding 
conditions are conceivable, however, possibly far from pure roll- 
ing, where one of the disks supplies all the mechanical energy to 
the film, so that the other is frictionless. 


Authors’ Closure 


The authors agree with Professor Blok in that their results 
obtained on the Static Disk Apparatus do not conform in every 
respect to conditions prevailing in the Rotating Disks Testing 
Machine, particularly in connection with the effect of load on 
disks’ surface profiles and consequently on oil-film resistance. As 
mentioned in our concurrent paper, experiments seem to imply 
that, for thin films encountered in the tests, electric current ap- 
pears to flow in such an extremely narrow passage that actual 
disk surface profile across current flow would not affect its elec- 
trical resistance to any appreciable extent; specimen disks used 
in the tests were, moreover, case-hardened. The interpretation 
of electrical resistance measurements in terms of minimum film 
thicknesses could not, in the authors’ opinion, escape criticism 
entirely. Further investigations in this respect would certainly 
throw more light on the effect of disks’ geometry on the electrical 
resistance of the oil film. 

As far as friction measurements are concerned, the authors are 
quite aware of the fact that, in theory, viscous drag on disks under 
pure rolling does not really vanish. In present experiments, 
viscous torque could only be measured at the bottom specimen 
disk. Under pure rolling conditions, the bottom shaft dyna- 
mometer appears completely free from any torque; moreover, fric- 
tional torque changes sign as the ‘Pure Rolling” condition is 
being passed. It is evident that a comprehensive analytical 
study of the mechanism of film formation between disks is still 
to be sought. 
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The Measurement of Oil-Film 
Thickness in Gear Teeth 


The voltage drop across thin oil films when a constant current of 1 amp is passed, 1.e., 
the discharge voltage, is used to measure the oil-film thickness between loaded gear teeth 
while running. Itis found that the thickness at the pitch line is between 1 and 4 X 10~4 
in., which varies slightly with the viscosity and rather more strongly with load. The 
thickness at the tips and roots is very dependent on the tip relief. The conditions here 


may explain the difference between disk and gear tests. These experimental values are 


0... OF THE major troubles encountered in any work 
on gears is the difficulty in getting near the line of contact and 
the consequent problem of making any measurements there. 
Many investigators have tried to use electrical resistance as a 
means of deciding on the type of lubrication found in gears, but 
it is generally recognized that it is impossible to obtain any 
quantitative data from these measurements [1-3].' 


Experimental Study 


An experimental study of the passage of an electric current 
through a thin oil film shows that it very soon ceases to obey 
Ohm’s law (Brix [4] and Cameron[5]). As soon as the current 
reaches about '/, amp the voltage drop across the oil film becomes 
independent of the current. The voltage was shown to be sen- 
sibly constant from '/, to 350 amp (Cameron [5]) for rotating 
disks. This is the basis of the method of measuring oil-film 
thickness described in this paper. 

The actual value of this discharge voltage is found to vary with 
the thickness of the film [4]. Once the calibration constant of 
voltage against oil-film thickness has been determined, this 
method can be used for measuring oil-film thickness in any type 
of machinery and its application to gears is described here. All 
that is needed to determine the film thickness is a simple measure- 
ment of the voltage. In general, there will be a pure ohmic resist- 
ance associated with the circuit and bence it is usually desirable 
to keep the current constant so the pure ohmic voltage drop of the 
leads, slip-rings, and so on, is a known constant which can be sub- 
tracted from the over-all voltage drop. For this reason the ohmic 
or metallic resistance is kept as low as possible by the use of thick 
electric leads, and the like. 

The calibration of the discharge voltage/oil-film thickness 
constant has been described in detail by the authors and their 
co-workers [6 and 7]. It is found that the constant is independ- 
ent of the viscosity of the oil from a given type of crude and 
method of refining. The effect of different types of oil with more 
or less saturated molecules or rings has not yet been studied. 

The oil used had a calibration constant of 4 volts per 0.001 in. 
The oil was recalibrated at intervals during the work and this 
figure did not change. 

Details are given in the paper on calibration [6] of the effect 
of dirt particles. It was found that if these were removed care- 


1 Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division for presentation at the 
Annual Meeting, New York, N. Y., November 30-December 5, 1958, 
of THe AMERICAN Society oF MecHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Sep- 
tember 2, 1958. Paper No. 58—A-142. 
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compared with theoretical work and are shown to be of the same order of magnitude. 


fully by an edgewise paper filter the constant rose to 12 volts per 
0.001 in. 

It must be emphasized that the discharge voltage is quite 
different from the resistance of the oil. The apparatus needed to 
measure the two is nearly the same, but in the method described 
here the voltage required to maintain a steady discharge (which 
is defined as the discharge voltage) is measured. The current is 
kept sensibly constant, which is dictated only by the need to keep 
the ohmic voltage drop in the eleciric leads constant. 

The voltage required to start the discharge or arc is of course 
higher than the one needed to maintain it. It is thought that it 
is started by adventitious dirt particles. 

The fact that the calibration of the oil was repeated several 
times during the work and found to be constant avoids a number 
of unanswered questions, such as the effect of oil composition, 
dirt, moisture, and so forth. These are all being studied in this 
laboratory. 

The object of the paper is to describe the application of this 
voltage-discharge method to the study of the lubrication of gears 
while running under load. The oil-film thickness so measured 
will be correlated with published theoretical work. 

In so far as we know the only analysis of the problem, taking 
into account the pressure viscosity effect, and the deformation of 
the gear teeth, has appeared in Russia, by Grubin and Vinogra- 
dova [8], and Petrusevich [9]. A certain amount of work has been 
published on the pressure-viscosity effect alone, Gatcombe [10] 
and Cameron [11], or on the deformation of gear teeth by a con- 
stant-viscosity lubricant, Dérr [12] and Poritsky [13]. The com- 
bination of the two is essential and an accurate solution of the 
problem is now the outstanding question in the theory of gear 
lubrication. The Russian work, while a useful pointer, contains 
a number of approximations which limits its accuracy. 


Apparatus 


The apparatus is a conventional 5-inch-centers back-to-back 
gear-testing machine. The loading gears are mounted in a 
framework which can swing around the axis of one of the shafts. 
The static torque applied to this frame is transferred by the 
epicyclic action of the gear in the swinging frame into a circulating 
torque in the system, Fig. 1. 

Two sets of straight spur gears were used. The first set 
consisted of 19/40 teeth and the second 30/30, the high-speed 
shaft being directly coupled to a 2920-rpm 6-hp motor. The 
gears in both cases were to BSS proportions, with a pressure angle 
of 22'/, deg for the 19/40 set and 20 deg for the 30/30 and an 
addendum of 0.3183 CP and dedendum of 0.4583 CP. The 
19/40 gears were of EN34 2 per cent Ni-Mo, case-hardened 


MARCH 1960 / 29 


~ 
4 
q va . 
- 
a 
A 
7 
* 
4 
} 
| 
i 


ARY 


FRAME 


BLANKING 


Fig. 1 Diagram of gear testing machine 
steel of 0.12 to 0.20 per cent carbon, and of 6 DP. The small 
geur was 3'/.-in. OD. The face width was 7/j. in. 

The 30/30 gears were of EN 32 case-hardened steel, 0.15 per 
cent carbon, and the OD was 5'/; in. 

In both cases the surface roughness was */.; microinch average. 

The tip relief also was varied. The 19/40 gears had profiles 
as shown in Fig. 2(a), and the 30/30 in Fig. 2(6). A pure in- 
volute would give a straight line, and the deviation from the line 
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Fig. 2 Traces of tooth form. (A)—0.003 in. tip relief; (B)—0.001 in. tip 
relief. Straight line represents pure involute. 


shows the amount of tip relief. These different tip reliefs had a 
profound effect on the oil-film thickness, as will be shown later. 
The oils used are listed in Appendix 1. The temperatures were 
not controlled, but measured by a thermometer placed near the 
(outgoing) meshing line. Oil was fed by gravity to the gears. 
The only modification to the apparatus was to provide insula- 
tion to one gear-wheel shaft. This was achieved by insulating 
both the coupling in the swinging shaft with nylon bolts and a 
nylon facing, and also the two ball bearings. This latter was 
done by boring out the bearing housings by '/s in. on the diameter 
and casting in '/s in. thickness of hard rubber, which was then 
ground back to the original diameter. The same treatment was 
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Fig. 3 Oil-film thickness, with various loads, viscosity constant 
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carried out on the locating face of the housings. In this way 
the hardened splines holding the gear on the shaft were not altered 
and standard gears could be used. It is important to be able to 
use standard gears in all test machines of this kind. 

The current is led in and out of the machine by attaching '/s in. 
diam springs to each of the stub ends of the shaft. These springs 
are led through a hole into a small container filled with mercury, 
which provides a steady, low resistance contact. In this way 
there was a minimum of resistance to the passage of the current 
in the circuit. Two contacts were used, one on each «haft to 
insure that the system was perfectly grounded. 

A timing mark is obtained by placing a small horseshve electro- 
magnet very near the gear teeth. The distance between the two 
legs of the magnet equaled the circular pitch of the gear teeth. 
The signal from the windings, when amplified, gave a trace on 
the CRO. There was also a microswitch coupled to a 40:1 
reduction (in the case of the 40/19 gears) which blanked out the 
tube, except for a short period every 40th revolution. This en- 
abled the same set of teeth to be viewed each time. The period 
of contact of the microswitch allowed approximately five com- 
plete meshings to be photographed. 

The electrical circuit was quite simple. A double-beam, cath- 
ode-ray oscilloscope was used, the time base being synchronized 
b.- the time marker. The trace from the marker was displayed by 
one beam, and the discharge voltage on the other. The constant 
current was obtained by connecting a 6-volt car battery in series 
with a slider resistance. The variation in discharge voltage was 
so small that there were negligible fluctuations in the current. 
It is for this reason that the ohmic resistance of the circuit was 
kept low—in this case 0.03 ohm. 

The discharge voltage, appearing across the two mercury con- 
tacts, was fed directly into the amplifier of the oscilloscope, the 
amplification being known, it could be measured. 

The trace from the time marker enabled the position of the 
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2 A description of sliprings very similar to those used here has just 
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Fig. 5 Fifect of larger tip relief; same conditions as Fig. 3 
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mesh to be determined. The shape of the discharge-voltage 
curves agreed with the time marker, in that the minimum oil film 
was at the tips and the maximum film was at the pitch line. This 
will be discussed in the next section. 


Results 


It has been established that voltage is proportional to oil- 
film thickness; hence a measurement of this voltage gives the 
oil-film thickness directly. The time marker enables the state 
of the mesh at any instant to be determined. 

Typical results are shown in Figs. 3, 4, and 5. In all these 
photographs the vertical ordinate is voltage; i.e., oil-film thick- 
ness. The horizontal ordinate is time and, on the photographs, 
the pitch points at successive contacts are marked. 

There is very considerable “hash” or scatter in the oil-film- 
thickness (voltage) reading. This is considered to be due to the 
surface roughnesses or dirt. On several photographs this 
scatter was equal to 25 microinches which Was approximately 
equal to the average surface roughness of the teeth. 

These photographs therefore enable the oil-film thickness 
during the contact to be read off directly, within the uncertainty 
of the scatter and, of course, of the accuracy of the calibration. 

Typical photographs are shown in Fig. 3. Here the load is 
varied, and the viscosity of the oi! used (No. 3) is kept constant. 
This is always a difficult matter in gear testing as the frictional 
energy dissipated in the mesh is dependent on the load. To 
minimize the variation of temperature, the machine was run at a 
medium constant load till steady conditions were attained and the 
load varied as quickly as possible. 

Calibration carried out on the apparatus described previously 
[6] showed that 4 volts corresponded to 0.001 in. of oil film. 
Hence both voltage and corresponding oil-film thickness are 
shown on the figure. The diagram shows that as the load is in- 
creased the contact at the tips and roots increases considerably. 
We define ‘‘contact’’ as that state of lubrication where we can 
measure no oil film. This increase in contact at the tips and 
roots is especially clear in the photograph of the 1800-lb-per-in. 
load. 

At the pitch line the oil film is very considerable, about 10~* 
in. This does not decrease very much with increasing load. 

In Fig. 4 the same gears were used to study the effect of vis- 
- cosity on the oil-film thickness at a constant load of 2000 lb per in. 
Several different oils were used, and the viscosities were cal- 
culated from the temperature measured by the thermometer 
placed near the mesh. 

These pictures are self-explanatory. The greatest difference 
is the change in the oil-film thickness at the tips and roots on the 
one hand, and the pitch line on the other. Even with the thick- 
est oil there is evidence of slight contact at the extreme ends of 
the contact. With the very thin oils, where there is virtually 
zero Oil film over the entire contact path, the oil film builds up at 
the pitch line where there is pure rolling and no sliding. 

In Fig. 5 a series of photographs was taken with the same loads 
and type of oil as were used in Fig. 3. In this case the 30/30 
gears. were used with large tip relief (0.001 in.) as opposed to the 
19/40 gears which had only a small tip relief (0.0003 in.) as de- 
tailed in Fig. 2. The comparison of the two photographs shows 
that the build-up of the oil film is quite different, there being very 
little break through at the ends of the line of contact. It is 
thought that this technique should make it possible to study the 
effect of tip relief on gears under service conditions. It must be 
emphasized that this is only a very preliminary study of the 
various factors involved, and further work on them is clearly 
desirable. 

Finally, the various results of all these tests are plotted together 
in Figs. 6 and 7. The maximum and minimum oil-film thick- 


32 / MARCH 1960 


Expmt! 
Oo MAK 
Oo MIN 


Max 


Theory 
GRUBIN 


Oil Film Thickness inches 
a 


001 + 
oS O6 10 «#12 16 16182022 25x1000 
Pitch Line Load Pounds per inch 


Fig. 6 Collected results. Film thickness at pitch line against load (con- 
stant viscosity 119 c-s). Theoretical and experimental. 
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Fig. 7 Collected results. 


Film thickness at pitch line against viscosity 
(constant load 2000 Ib /in). 


Theoretical and experimental. 


nesses are shown. In Fig. 6 the effect of load on oil-film 
thickness at the pitch line is plotted for a given viscosity. In 
Fig. 7 the effect of viscosity on oil-film thickness, also at the 


pitch line, is displayed for a constant load of 2000 lb perin. The 
extremely small variation with viscosity is most interesting, 
though of course the oil-film thickness at the pitch line is more of 
theoretical than practical interest. The difficulty of measuring 
the true viscosity may have influenced the slope of thisline. The 
effect of load is larger than expected; this will be discussed later. 

The theoretical values of Grubin and Vinogradova [8] and 
Petrusevich [9] also are plotted on these graphs. The parame- 
ters needed to evaluate these formulas are discussed in Ap- 
pendix 2. 


Discussion of Results 


Figs. 6 and 7 showing the effect of load and viscosity on film 
thickness indicate that at the pitch line the oil-film thickness 
only varies between 1 and 4 X 10~‘ in., for a variation of viscosity 
of a thousandfold. This corresponds to the thickness varying 
with the 0.15th-power of the viscosity. This is very much smaller 
than the theoretical work drawn on the same diagram indicates. 
There are, of course, many factors which have not been taken 
into account in the derivation of these equations, apart from 
their own intrinsic approximations. One unknown factor is 
probably the relaxation time of the oil, as is also the adiabatic 
and frictional heating of the oil as it passes through the con- 
tact zone, and the variation of viscosity across the oil film it- 
self. These are all formidable problems when considered on 
their own without the complications found in gear lubrication. 
Work is in progress at the moment in this laboratory on the 
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effect of relaxation time, and the variation of viscosity across 
the oil film is the subject of a separate paper by Cameron [14]. 

It is satisfactory that these results in Fig. 6 agree closely with 
the excellent work of Crook [15] who used a capacity method for 
measuring the oil-film thickness between rolling disks. This 

correlation confirms the reliability of the simple method de- 
scribed here. The fact that a change in load has a larger effect 
on the oil-film thickness than viscosity is surprising as the vis- 
cosity term comes into the equations twice, once in its own right 
and once contained in the pressure-viscosity coefficient which 
has been discussed by Wooster [16], whose correlation is used to 
evaluate the theoretical equations. These equations are for 
convenience stated in the Appendix as their sources are probably 
not readily available to the majority of readers. 
> The large diminution of oil-film thickness at the ends of 
the contact line, where the load is nominally half its value at the 
pitch line (being shared by two teeth) is quite unexpected. The 
flash temperature naturally is larger at the tips where there is 
maximum sliding, but most studies (Lane and Hughes [17]) show 
that it is only of the order of 20 to 30 deg C. The change of 
Viscosity consequent on this increase of temperature is not nearly 
large enough to explain this very big drop in film thickness in 
view of the insensitivity of film thickness on viscosity shown in 
Fig. 7. It may be that there is a much larger impact load than is 
normally suspected, though here again the results shown in 
Fig. 6 indicate that the oil-film thickness is relatively insensitive 
to changes in load at high loadings. The work carried out by 
Harris [18] would not lead one to expect an impact load greater 
than 2 or 3 times nominal. 

The difference in the diagrams with and without large tip 
relief, Figs. 3 and 5, would indicate that the low oil-film thickness 
at the tips without sufficient tip relief, could be due to edge con- 
tact. This would mean that the sharp edge of the tip of the tooth 
digs into the opposing surface and the oil film cannot be estab- 
lished immediately for a finite time corresponding to an apprecia- 
ble distance down the tooth flank. Some unpublished results 
from the 4-ball machine work in this laboratory show that the 
high-pressure oil film separating two curved surfaces is not al- 
ways formed immediately. 


Conclusion 


The severity of the conditions at the tips found in these ex- 
periments may help to explain why disk machines of the SAE, 
Timken, or similar types, do not always give good correlation 
with gears. In disk machines the moving members are under a 
steady load and do not have anything comparable with the sharp 
edge of the tip of the teeth. 

It is seen, therefore, that this method of measuring oil-film 
thickness has demonstrated the existence of an oil film of con- 
siderable thickness at the pitch line. It is approximately of the 
same order of magnitude as that predicted by theoretical analysis. 

These experiments also show how very much the conditions of 
lubrication at the ends of the contact line can be altered by tip 
relief, and it is very simple to study it with this apparatus. 
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APPENDIX 1 


The oils used were straight undoped mineral oils of the following 
physical properties: 


Oil no. 1 3 4 s 
Specific gravity... 0.863 0.883 0.9025 0.907 
Closed flash, deg F +415 +505 +535 +407 
Cloud point, deg F. 22 18 52 
Pour point, deg F.. . 15 5 5 0 
Kinematic viscosity, 100 F.. 31.8 119.5 528.2 197.7 
Centistokes, 210 F. . 5.35 3.27 Ba 14.29 
Viscosity index...... 113 101 98 69 
Colour tag Robinson 17'/2 17 2'/, 9 
Ramsbottom carbon, percent 0.06 0.12 0.56 0.15 
Neut. No. mgm KOH/gm... -—0.05 -—0.05 0.12 


APPENDIX 2 


The equation developed by Petrusevich [9] for the oil film 
thickness between rotating cylinders is: 


(mol 


13 


Amin 


where 


No = viscosity at atmospheric pressure in kg-sec/cm? 
(U, + U2) total rolling velocity of disks 1 ahd 2 in 
em/sec 
reduced radius, em, defined as 1/R = 1/R, + 1/R, 
Pmax = peak Hertzian (elastic) pressure at the contact in 
kg/sq em 
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a = pressure viscosity coefficient in (kg sq em)~! defined 
from the standard exponential equation 
Np = No exp (ap). ...see hereafter 


This equation is a generalization from 3 computed values and it 
is only valid for Pmax above 10,000 kg/sq cm. We have found 
it convenient to compute in a transposed form: 


169 R 
Petrusevich states he has taken into account the compressibility 
of the oil as well as the temperature of the oil under sliding and 
its thermal conductivity. No details of its derivation or computa- 


tion are available. 
The Grubin and Vinogradova equation [8] is: 


; (noUa)?-727 RO. 364 


in addition to the previously defined symbols: 
P = load/unit length 


o = Poisson’s ratio 
E = Young’s modulus 


This formula is dimensionally consistent so any homogeneous 
system of units can be used. 

The correlation we use for the pressure viscosity coefficient is 
due to Wooster [15] and is 


a = (0.6 + logiomo) X 107% 
with a in (kg/sq em)~ 


No in centipoises. 


DISCUSSION 
E. K. Gatcombe* 


The main point of discussion raised here is whether or not the 
oil film changes thickness at the pitch point phase of mating. 

From information gained through research I have concluded 
that these hydrodynamic films can change thickness very sud- 
denly, both in the way of a sudden decrease in thickness as well as 
a possible sudden increase in thickness if proper conditions exist. 

We can imagine that the film is of a certain thickness at the 
first phase of contact, where the velocity of sliding is relatively 
high and the loads relatively light. Then, as contact progresses 
towards the pitch point phase, the velocity of sliding decreased to 
zero and the loads are relatively high. 

Now the high velocity of sliding might cause the bulk viscosity 
of the oil to be decreased and thus the magnitude of the oil film 
thickness might be reduced, But we must remember that a high 
relative velocity, in general, is proper for the build-up of relatively 
thick hydrodynamic films, except for cases where certain factors, 
as mentioned above, the resultant associated decrease in viscosity, 
may cause the films to be reduced in thickness. 

Then, on the assumption that hydrodynamic films exist, I 
would expect that there could be a decrease in the film thickness 
right at the pitch point phase, as is indicated by your osciilograph 
traces. 


3 Professor, Mechanical Engineering Department, U. 8. Naval 
Postgraduate School, Monterey, Calif, Mem. ASME. 
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But if we assume that plastic films result from these extremely 
high contact pressures, then they probably change in thickness 
only very slightly over the entire tooth engagement cycle. 


Authors’ Closure 


Dr. Gatcombe has raised some question about the relative mag- 
nitude of the oil film thickness at the pitch line, suggesting that 
there might be a drop in its thickness at the pitch line during sin- 
gle pair action under conditions of pure rolling. We find, how- 
ever, that for heavily loaded contacts pure rolling can be expected 
to be a more favorable situation for oil film formation. In spite 
of the fact that two pairs of teeth are sharing the load at roots and 
tips, it is the high impact coupled with a high sliding action which 
tends to reduce the oil film at this point. For all 20-deg involute 
tooth forms the combined deflection of a pair of teeth is found to 
be roughly 0.001 of an inch per 2000 Ib/in. load.‘ The results 
suggest that, as soon as the combined deflection of the outgoing 
teeth in mesh exceeds the tip relief provided then serious inter- 
ference results at root and tip of the incoming pair. Referring to 
Fig. 3, the oil film at root and tip rapidly deteriorates from ap- 
proximately a 1000 lb/in. load and upward. When tip and root 
relief (Fig. 2) are considered together, the over-all relief on the 
teeth amounts to approximately 0.0005 in., which just about ap- 
proximates the combined deflection of the teeth for this load. 
Dr. Gatcombe’s suggestions of some sort of ‘“‘squeeze action’’ 
during conditions of pure rolling needs study and it is the au- 
thor’s opinion that more attention will have to be given in the fu- 
ture to the theory to differentiate between conditions of pure 
rolling and pure sliding even when the temperature effects are 
taken into consideration in the latter case. 

In regard to Dr. Felan’s remark during the discussion about 
extreme pressure additives, so far we have only tested straight 
mineral oils. The authors suspect, however, that the use of an 
additive oil will not appreciably change the traces shown. In 
other words, the E..P. additives are operable in the region of so- 
called zero oil film. Thus the technique used only measures the 
thickness of a hydrodynamic film and does not offer a satisfactory 
calibration for surface films formed under the action of heat and 
pressure. 

It is true that, with the measuring technique used, it is not pos- 
sible to determine the relative disposition of the oil film thickness 
during two pair action. The oscillograph traces do show the 
minimum of the two oil films present. In regard to his sugges- 
tion of using a contact ratio of 1, this has not been tried. The 
difficulty, of course, with such a small contact ratio is that it re- 
sults in rough action of the gears causing higher impact loads 
which might nullify any results which might be obtained. We 
have considered the use of a single tooth which is insulated from 
the rest of the gear but the reader can appreciate that this would 
be difficult to do and still simulate the elastic characteristics of the 
remainder of the teeth on the gear. 

Dr. Poritsky has asked for some clarification of the reasons for 
scatter in the oscillograph traces shown. Some idea of the rela- 
tive influences of the various factors which govern the nature of 
the picture may be obtained from the list of sizes given here: 


Surface roughnesses 16 to 25 microinches 

Dirt particles 50 to 100 microinches 

Oil film thickness................50 to 250 microinches 
(Net, minus thickness of dirt particles) 


4H. Walker, “Gear Tooth Deflection and Profile Modifications,” 
The Engineer, vol. 166, p. 409. 
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A New Approach to Linear Filtering 
and Prediction Problems 


R. E. KALMAN 


Research Institute for Advanced Study,’ 
Baltimore, Md. 


The classical filtering and prediction problem is re-examined using the Bode-Shannon 
representation of random processes and the ‘‘state-transition"’ method of analysis of 
dynamic systems. New 


results are: 


(1) The formulation and methods of solution of the problem apply without modifica- 
tion to stationary and nonstationary statistics and to growing-memory and infinite- 


memory filters. 


(2) A nonlinear difference (or differential) equation is derived for the covariance 


matrix of the optimal estimation error. 


From the solution of this equation the co- 


efficients of the difference (or differential) equation of the optimal linear filter ave ob- 
tained withou! further calculations. 
(3) The filtering problem is shown to be the dual of ‘he noise-free ranulas problem. 
The new method developed here is applied to two well-known problems, confirming 
and extending earlier results. 


The discussion ts largely self-contained and proceeds from first principles; 


basic 


concepts of the theory of random processes are reviewed in the Appendix. 


Introduction 


N IMPORTANT class of theoretical and practical 
problems in communication and control is of a statistical nature. 
Such problems are: (i) Prediction of random signals; (ii) separa- 
tion of random signals from random noise; (iii) detection of signals 
of known form (pulses, sinusoids) in the presence of random noise. 

In his pioneering work, Wiener [1 ]* showed that problems (i) 
and (ii) lead to the so-called Wiener-Hopf integral equation; he 
also gave a method (spectral factorization) for the solution of 
this integral equation in the practically important special case of 
stationary statistics and rational spectra. 

Many extensions and generalizations followed Wiener’s basic 
work. Zadeh and Ragazzini solved the finite-memory case {2}. 
Concurrently and independently of Bode and Shannon [3], they 
also gave a simplified method [2] of solution. Booton dis- 
cussed the nonstationary Wiener-Hopf equation [4]. These 
results are now in standard texts [5-6]. A somewhat different 
approach along these main lines has been given recently by 
Darlington [7]. For extensions to sampled signals, see, e.g., 
Franklin [8], Lees [9]. Another approach based on the eigen- 
functions of the Wiener-Hopf equation (which applies also to 
nonstationary problems whereas the preceding methods in general 
don’t), has been pioneered by Davis [10] and applied by many 
others, e.g., Shinbrot [11], Blum [12], Pugachev [13], Solodov- 
nikov [14]. 

In all these works, the objective is to obtain the specification of 
a linear dynamic system (Wiener filter) which accomplishes the 
prediction, separation, or detection of a random signal.‘ 


! This research was supported in part by the U. 8. Air Force Office 
of Scientific Research under Contract AF 49 (638)- 382. 

? 7212 Bellona Ave. 

’ Numbers in brackets designate References at end of paper. 

* Of course, in general these tasks may be done better by nonlinear 
filters. At present, however, little or nothing is known about how to 
obtain (both theoretically and practically) these nonlinear filters. 

Contributed by the Instruments and Regulators Division and 
presented at the Instruments and Regulators Conference, March 29- 
April 2, 1959, of Taz American Socrety oF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, February 
24, 1959. Paper No. 59—IRD-11. 
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Present methods for solving the Wiener problem are subject to 
a number of limitations which seriously curtail their practical 
usefulness: 


(1) The optimal filter is specified by its impulse response. It 
is not a simple task to synthesize the filter from such data. 

(2) Numerical determination of the optima: impulse response 
is often quite involved and poorly suited to machine computa- 
tion. The situation gets rapidly worse with increasing complex- 
ity of the problem. 

(3) Important generalizations (e.g., growing-memory filters, 
nonstationary prediction) require new derivations, frequently 
of considerable difficulty to the nonspecialist. 

(4) The mathematics of the derivations are not transparent. 
Fundamental assumptions and their consequences tend to be 
obscured. 

This paper introduces a new look at this whole assemblage of 
problems, sidestepping the difficulties just mentioned. The 
following are the highlights of the paper: 

(5) Optimal Estimates and Orthogonal Projections. The 
Wiener problem is approached from the point of view of condi- 
tional distributions and expectations. In this way, basic facts of 
the Wiener theory are quickly obtained; the scope of the results 
and the fundamental assumptions appear clearly. It is seen that 
all statistical calculations and results are based on first and second 
order averages; no other statistical data are needed. Thus dif- 
ficulty (4) is eliminated. This method is well known in probabil- 
ity theory (see pp. 75-78 and 148-155 of Doob [15] and pp. 455- 
464 of Loéve [16]) but has not yet been used extensively in en- 
gineering. 

(6) Models for Random Processes. Following, in particular, 
Bode and Shannon [3], arbitrary random signals are represented 
(up to second order average statistical properties) as the output 
of a linear dynamic system excited by independent or uncorre- 
lated random signals (‘‘white noise’). This is a standard trick 
in the engineering applications of the Wiener theory [2-7]. The 
approach taken here differs from the conventional one only in the 
way in which linear dynamic systems are described. We shall 
emphasize the concepts of state and state transition; in other 
words, linear systems will be specified by systems of first-order 
difference (or differential) equations. This point of view is 
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natural and also necessary in order to take advantage of the 
simplifications mentioned under (5). 

(7) Solution of the Wiener Problem. With the state-transition 
method, a single derivation covers a large variety of problems: 
growing and infinite memory filters, stationary and nonstationary 
statistics, etc.; difficulty (3) disappears. Having guessed the 
“state” of the estimation (i.e., filtering or prediction) problem 
correctly, one is led to a nonlinear difference (or differential) 
equation for the covariance matrix of the optimal estimation 
error. This is vaguely analogous to the Wiener-Hopf equation. 
Solution of the equation for the covariance matrix starts at the 
time t when the first observation is taken; at each later time ¢ 
the solution of the equation represents the covariance of the op- 
timal prediction error given observations in the interval (t, 1). 
From the covariance matrix at time ¢ we obtain at once, without 
further calculations, the coefficients (in general, time-varying) 
characterizing the optimal linear filter. 

(8) The Dual Problem. The new formulation of the Wiener 
problem brings it into contact with the growing new theory of 
control systems based on the “‘state’’ point of view [17-24]. It 
turns out, surprisingly, that the Wiener problem is the dual of 
the noise-free optimal regulator problem, which has been solved 
previously by the author, using the state-transition method to 
great advantage [18, 23, 24]. The mathematical background of 
the two problems is identical—this has been suspectec all along, 
but until now the analogies have never been made explicit. 

(9) Applications. The power of the new method is most ap- 
parent in theoretical investigations and in numerical answers to 
complex practical problems. In the latter case, it is best to resort 
to machine computation. Examples of this type will be discussed 
later. To provide some feel for applications, two standard 
examples from nonstationary prediction are included; in these 
cases the solution of the nonlinear difference equation mentioned 
under (7) above can be obtained even in closed form. 


For easy reference, the main results are displayed in the form of 
theorems. Only Theorems 3 and 4 are original. The next sec- 
tion and the Appendix serve mainly to review well-known ma- 
terial in a form suitable for the present purposes, 


Notation Conventions 


Throughout the paper, we shall deal mainly with discrete (or 
sampled) dynamic systems; in other words, signals will be ob- 
served at equally spaced points in time (sampling instants). By 
suitable choice of the time scale, the constant intervals between 
successive sampling instants (sampling periods) may be chosen as 
unity. Thus variables referring to time, such as ¢, t, 7, 7 will 
always be integers. The restriction to discrete dynamic systems 
is not at all essential (at least from the engineering point of view); 
by using the discreteness, however, we can keep the mathe- 
matics rigorous and yet elementary. Vectors will be denoted by 
small bold-face letters: a, b,..., u, x, y,... 
-precisely an n-vector is a set of n numbers 1, . . 
co-ordinates or components of the vector x. 

Matrices will be denoted by capital bold-face letters: A, B, Q, 
®, ...; they are m X n arrays of elements a,;, b;;, q;,, 
The transpose (interchanging rows and columns) of a matrix will 
be denoted by the prime. In manipulating formulas, it will be 
convenient to regard a vector as a matrix with a single column. 

Using the conventional definition of matrix multiplication, we 
write the scalar product of two n-vectors x, y as 


n 
= 
t=1 


The scalar product is clearly a scalar, i.e., not a vector, quantity. 


A vector or more 
the x; are the 
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Similarly, the quadratic form associated with the n X n matrix Q 
is 


x'Qx = 


ij=l 


We define the expression xy’ where x is an m-vector and y is an 
n-vector to be the m X n matrix with elements z,y,. 

We write E(x) = Ex for the expected value of the random vec- 
tor x (see Appendix). It is usually convenient to omit the brackets 
after E. This does not result in confusion in simple cases since 
constants and the operator EF commute. Thus Exy’ = matrix 
with elements E(z,y;); ExEy’ = matrix with elements E(z,)E(y,). 

For ease of reference, a list of the principal symbols used is 
given below. 


Optimal Estimates 


t time in general; present time. 
tf) time at which observations start. 
y(t) 
a, *(t|t) 
L 


basic random variables. 

observed random variable. 

optimal estimate of given y(ty), . . ., y(t). 

loss function (nonrandom function of its argument). 
estimation error (random variable). 


‘ 


Orthogonal Projections 
Y(t) 


orthogonal projection of z(t,) on Y(t). 
component of z(t,) orthogonal to Y(t). 


linear manifold generated by the random variables y(t), 


Models for Random Processes 


@(i + 1; t) 
A(t) 


Solution of the Wiener Problem 


x(t) 
yl’) 
Y(t) 
Z(t) 
x*(t;|t) 
x(t, |) 


transition matrix 
covariance of random excitation 


basic random variable. 

observed random variable. 

linear manifold generated by y(t), . . ., y(¢). 
linear manifold generated by y(t\¢ — 1). 
optimal estimate of x(¢,) given 

error in optimal estimate of x(t,) given ‘Y(¢). 


Optimal Estimates 


To have a concrete description of the type of problems to be 
studied, consider the following situation. We are given signal 
and noise z2(t). Only the sum 7(t) = 2;(t) + x(t) can be ob- 
served. Suppose we have observed and know exactly the 
values of y(t), ..., y(t). What can we infer from this knowledge 
in regard to the (unobservable) value of the signal at 4 = 4, where 
t, may be less than, equal to, or greater than t? If é, < ¢, thisisa 
data-smoothing (interpolation) problem. If t; = 1, this is called 
filtering. Uf t; > t, we have a prediction problem. Since our treat- 
ment will be general enough to include these and similar problems, 
we shall use hereafter the collective term estimation. 

As was pointed out by Wiener [1], the natural setting of the 
estimation problem belongs to the realm of probability theory 
and statistics. Thus signal, noise, and their sum will be random 
variables, and consequently they may be regarded as random 
processes. From the probabilistic description of the random 
processes we can determine the probability with which a par- 
ticular sample of the signal and noise will occur. For any given 
set of measured values n(t), . . . , n(t) of the random variable y(t) 
one can then also determine, in principle, the probability of simul- 
taneous occurrence of various values £,(¢) of the random variable 
z(t). This is the conditional probability distribution function 
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S Eily(to) = nt), .. ., y(t) = = (1) 


Evidently, F(&,) represents all the information which the meas- 
urement of the random variables y(t), . . ., y(t) has conveyed 
about the random variable z;(t,). Any statistical estimate of the 
random variable 2x,(¢,) will be some function of this distribution 
and therefore a (nonrandom) function of the random variables 
y(t), ..., y(t). This statistical estimate is denoted by X1(t, It), 
or by just X,(t,) or X,; when the set of observed random variables 
or the time at which the estimate is required are clear from con- 
text. 

Suppose now that X;, is given as a fixed function of the random 
variables y(t), ..., y(t). Then X, is itself a random variable and 
its actual value is known whenever the actual values of y(t), 

.-, y(t) are known. In general, the actual value of X,(¢,) will be 
different from the (unknown) actual value of z;(t;). To arrive at 
a rational way of determining X,, it is natural to assign a penalty 
or loss for incorrect estimates. Clearly, the loss should be a (i) 
positive, (ii) nondecreasing function of the estimation error € = 
a(t) — Xi(t,). Thus we define a loss function by 

= 0 
Ke) = 2 0 


Lhe) = L(-€) 


when & 2€ 20 (2) 


Some common examples of loss functions are: L(e) = ae?, ae, 
ale, a({l — exp (—e*)], ete., where a is a positive constant. 

One (but by no means the only) natural way of choosing the 
random variable X, is to require that this choice should minimize 
the average loss or risk 


— = ELE{L (z(t) — ., 
(3) 


Since the first expectation on the right-hand side of (3) does not 
depend on the choice of X, but only on y(t), . 
that minimizing (3) is equivalent to minimizing 


. + y(0), it is clear 


— Xi(t) . . -, ylt)} (4) 


Under just slight additional assumptions, optimal estimates can 
be characterized in a simple way. 

Theorem 1. Assume that L is of type (2) and that the conditional 
distribution function F(&) defined by (1) is: 


(A) symmetric about the mean E: 
F(é — = 1 — P(E — &) 
(B) conver for < 
F(AE, + (1 — ADE) S AP(E) + (1 — 
for all & EandO SX 


Then the random variable x,*(t;|t) which minimizes the average loss 
(3) ts the conditional expectation 


= ..., (5) 


Proof: As pointed out recently by Sherman [25], this theorem 
follows immediately from a well-known lemma in probability 
theory. 

Corollary. Jf the random processes {x,(t)}, {xo(t)}, and {y(t)} 
are gaussian, Theorem 1 hoids. 

Proof: By Theorem 5, (A) (see Appendix), conditional distribu- 
tions on a gaussian random process are gaussian. Hence the re- 
quirements of Theorem | are always satisfied. 

In the control systems literature, this theorem appears some- 
times in a form which is more restrictive in one way and more 
general in another way: 
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Theorem I-c. Jf Lie) = €*, then Theorem 1 is true without as- 
sumptions (A) and (B). 
Proof: Expand the conditional expectation (4): 


)E y(t)] v Xi%Xh) 


and differentiate with respect to X(t). This is not a completely 
rigorous argument; for a simple rigorous proof see Doob [15], pp. 
77-78. 

Remarks: (a) As far as the author is aware, it is not known 
what is the most general class of random processes {2;(t)}, {z2(t)} 
for which the conditional distribution function satisfies the re- 
quirements of Theorem 1. 

(b) Aside from the note of Sherman, Theorem 1 apparently 
has never been stated explicitly in the control systems literature. 
In fact, one finds many statements to the effect that loss functions 
of the general type (2) cannot be conveniently handled mathe- 
matically. 

(c) In the sequel, we shall be dealing mainly with vector- 
valued random variables. In that case, the estimation problem is 
stated as: Given a vector-valued random process {x(t)} and ob- 
served random variables y(to), . . ., y(t), where y(t) = Mx(t) (M 
being a singular matrix; in other words, not all co-ordinates of 
x(t) can be observed), find an estimate X(t,;) which minimizes the 
expected loss x(t,) — X(t,)|')!, |! || being the norm of a 
vector 

Theorem 1 remains true in the vector case also, provided we re- 
quire that the conditional distribution function of the n co-ordi- 
nates of the vector x(t), 


Pri[z(t) é,, ee 


be symmetric with respect to the n variables & — &,..., & — & 
and convex in tse region where all of these variables are negative. 


Orthogonal Projections 


The explicit calculation of the optimal estimate as a function 
of the observed variables is, in general, impossible. There is an 
important exception: The processes {z,(t)}, {22(t)} are gaussian. 

On the other hand, if we attempt to get an optimal estimate 
under the restriction L(é) = € and the additional requirement 
that the estimate be a linear function of the observed random 
variables, we get an estimate which is identical with the optimal 
estimate in the gaussian case, without the assumption of linearity 
or quadratic loss function. This shows that results obtainable by 
linear estimation can be bettered by nonlinear estimation only 
when (i) the random processes are nongaussian and even then (in 
view of Theorem 5, (C)) only (ii) by considering at least third- 
order probability distribution functions. 

In the special cases just mentioned, the explicit solution of the 
estimation problem is most easily understood with the help of a 
geometric picture. This is the subject of the present section. 

Consider the (real-valued) random variables y(t), . . ., y(t). 
The set of all linear combinations of these random variables with 
real coefficients 


t 


ayy(7) (6) 


t=b 


forms a vector space (linear manifold) which we denote by Y(t). 

We regard, abstractly, any expression of the form (6) as “point” 

or “vector” in Y(t); this use of the word “‘vector’’ should not be 
, confused, of course, with “vector-valued” random variables, etc. 
Since we do not want to fix the value of ¢ (i.e., the total number of 
possible observations), Y(t) should be regarded as a finite-dimen- 
sional subspace of the space of all possible observations. 
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Given any two vectors u, v in Y(t) (i.e., random variables ex- 
pressible in the form (6)), we say that u and » are orthogonal if Euv 
= 0. Using the Schmidt orthogonalization procedure, as de- 
scribed for instance by Doob [15], p. 151, or by Loave [16], p. 
459, it is easy to select an orthonormal basis in Y(t). By this is 
meant a set of vectors ey, . . ., €, in Y(t) such that any vector in 
La can be expressed as a unique linear combination of éy, . . ., €; 
anc 


Eee, = 6 


Thus any vector % in Y(t) is given by 


x= 


and so the coefficients a, can be immediately determined with the 
aid of (7): 


Exe, = E (> e; = > a,Eee; = a6;, =a; (8) 


t=to 


It follows further that any random variable z (not necessarily 
in Y(t)) can be uniquely decomposed into two parts: a part % in 
Y(t) and a part £ orthogonal to Y(t) (i.e., orthogonal to every 
vector in Y(t)). In fact, we can write : 


z=8+2=)) (Exe +2 (9) 


t=to 


Thus £ is uniquely determined by equation (9) and is obviously 
a vector in Y(t). Therefore 2 is also uniquely determined; it 
remains to check that it is orthogonal to Y(t): 


Exe; = E(x Ee; = Eze; Eke; 


Now the co-ordinates of % with respect to the basis e,, . . ., e, are 

given either in the form Fe; (as in (8)) or in the form Eze; (as 

in (9)). Since the co-ordinates are unique, Eze; = Exe; (i = to, 

..,t); hence EZe, = 0 and Z is orthogonal to every base vector e, 

"i and therefore to Y(t). We call % the orthogonal projection of z on 

Theré another way in which the orthogonal projection can be 
characterized? “fis that vector in Y(t) (i.e., that linear function of 
the random variables’ y(és), . . ., y(t)) which minimizes the quad- 
ratic loss function. In fact, if # is any other vector in Y(t), we 
have 


E(x — 6)? = + — = — + (2 — 


Since 2 is orthogonal to every vector in Y(t) and in particular to 
— wwe have 


E(x — w)? = E(x — + E(R — =} E(x — (10) 


This shows that, if w also minimizes the quadratic loss, we must 
have E(% — w)? = 0 which means that the random variables % 
and Ww are equal (except possibly for a set of events whose proba- 
bility is zero). 

These results may be summarized as follows: 

Theorem 2. Let {2x(t)}, {y(t)} random processes with zero mean 


(1.e., Ex(t) = Ey(t) = 0 for all t). We observe y(t), ..., y(t). 
If either 


(A) the random processes {x(t)}, {y(t)} are gaussian; or 
(B) the optimal estimate is restricted to be a linear function of the 
observed random variables and L(€) = €*; 


then 
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z*(t|t) optimal estimate of x(t,) given y(t), . . ., y(t) 


= orthogonal projection R(t, |t) of z(t;)on Y(t). (11) 


These results are well-known though not easily accessible in the 
control systems literature. See Doob [15], pp. 75-78, or Pugachev 
[26]. It is sometimes convenient to denote the orthogonal pro- 
jection by 


R(t,|t) = 2*(t,|t) = 


The notation £ is motivated by part (b) of the theorem: If the 
stochastic processes in question are gaussian, then orthogonal 
projection is actually identical with conditional expectation. 

Proof. (A) This is a direct consequence of the remarks in con- 
nection with (10). 

(B) Since z(t), y(t) are random variables with zero mean, it is 
clear from formula (9) that the orthogonal part 2(¢, \t) of z(t;) with 
respect to the linear manifold Y(¢) is also a random variable with 
zero mean. Orthogonal random variables with zero mean are 
uncorrelated; if they are also gaussian then (by Theorem 5 (B)) 
they are independent. Thus 


0 = Ex(t|t) = 
= E[z(t) — .. WO] 


= . ., y(t)] — = 0 


Remarks. (d) A rigorous formulation of the contents of this 
section as t > © requires some elementary notions from the 
theory of Hilbert space. See Doob [15] and Lodve [16]. 

(e) The physical interpretation of Theorem 2 is largely a matter 
of taste. If we are not worried about the assumption of gaus- 
sianness, part (A) shows that the orthogonal projection is the op- 
timal estimate for all reasonable loss functions. If we do worry 
about gaussianness, even if we are resigned to consider only 
linear estimates, we know that orthogonal projections are not 
the optimal estimate for many reasonable loss functions. Since 
in practice it is difficult to ascertain to what degree of approxima- 
tion a random process of physical origin is gaussian, it is hard to 
decide whether Theorem 2 has very broad or very limited sig- 
nificance. 

(f) Theorem 2 is immediately generalized for the case of vector- 
valued random variables. In fact, we define the linear manifold 
Y(t) generated by y(t), . . ., y(t) to be the set of all linear com- 
binations 


t m 


a; jy (2) 


t=t j=1 


of all m co-ordinates of each of the random vectors y(t), . . 
The rest of the story proceeds as before. 

(g) Theorem 2 states in effect that the optimal estimate under 
conditions (A) or (B) is a linear combination of all previous ob- 
servations. In other words, the optimal estimate can be re- 
garded as the output of a linear filter, with the input being the 
actually occurring values of the observable random variables; 
Theorem 2 gives a way of computing the impulse response of the 
optimal filter. As pointed out before, knowledge of this impulse 
response is not a complete solution of the problem; for this 
reason, no explicit formulas for the calculation of the impulse 
response will be given. 


y(t). 


Models for Random Processes 


In dealing with physical phenomena, it is not sufficient to give 
an empirical description but one must have also some idea of the 
underlying causes. Without being able to separate in some sense 
causes and effects, i.e., without the assumption of causality, one 
can hardly hope for useful results. 


Transactions of the ASME 
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It is a fairly generally accepted fact that primary macroscopic 
sources of random phenomena are independent gaussian proc- 
esses. A well-known example is the noise voltage produced in a 
resistor due to thermal agitation. In most cases, observed random 
phenomena are not describable by independent random variables. 
The statistical dependence (correlation) between random signals 
observed at different times is usually explained by the presence 
of a dynamic system between the primary random source and the 
observer. Thus a random function of time may be thought of as the 
output of a dynamic system excited by an independent gaussian 
random process. 

An important property of gaussian random signals is that they 
remain gaussian after passing through a linear system (Theorem 
5(A)). Assuming independent gaussian primary random sources, 
if the observed random signal is also gaussian, we may assume 
that the dynamic system between the observer and the primary 
source is linear. This conclusion may be forced on us also because 
of lack of detailed knowledge of the statistical properties of the 
observed random signal: Given any random process with known 
first and second-order averages, we can find a gaussian random 
process with the same properties(Theorem 5(C)). Thus gaussian 
distributions and linear dynamics are natural, mutually plausible 
assumptions particularly when the statistical data are scant. 

How is a dynamic system (linear or nonlinear) described? 
The fundamental concept is the notion of the stale. By this is 
meant, intuitively, some quantitative information (a set of num- 
bers, a function, etc.) which is the least amount of data one has 
to know about the past behavior of the system in order to pre- 
dict its future behavior. The dynamics is then described in terms 
of state transitions, i.e., one must specify how one state is trans- 
formed into another as time passes. 

A linear dynamic system may be described in general by the 
vector differential equation 


dx/dt = F(t)x + D(t)u(t) 


and 


y(t) = M(¢)x(t) | 


where x is an n-vector, the state of the system (the components z; 
of x are called state variables); u(t) is an m-vector (m S n) 
representing the inputs to the system; F(t) and D(t) aren X n, 
respectively, n X m matrices. If all coefficients of F(t), D(¢), 
M(t) are constants, we say that the dynamic system (12) is time- 
invariant or stationary. Finally, y(t) is a p-vector denoting the 
outputs of the system; M(t) isan n p matrix; p S n. 

The physical interpretation of (12) has been discussed in detail 
elsewhere [18, 20, 23]. A look at the block diagram in Fig. 1 may 
be helpful. This is not an ordinary but a matrix block diagram 
(as revealed by the fat lines indicating signal flow). The inte- 


vit) ] at 


Fig. 1 
system 


Matrix block diagram of the general linear 


§ The probability distributions will be gaussian because macroscopic 
random effects may be thought of as the superposition of very many 
microscopic random effects; under very general conditions, such ag- 
gregate effects tend to be gaussian, regardless of the statistical proper- 
ties of the microscopic effects. The assumption of independence in 
this context is motivated by the fact that microscopic phenomena 
tend to take place much more rapidly than macroscopic phenom- 
ena; thus primary random sources would appear to be independent 
on &@ macroscopic time scale. 


Journal of Basic Engineering 


grator in Fig. 1 actually stands for n integrators such that the 
output of each is a state variable; F(t) indicates how the out- 
puts of the integrators are fed back to the inputs of the integra- 
tors. Thus f,(t) is the coefficient with which the output of the jth 
integrator is fed back to the input of the ith integrator. It is not 
hard to relate this formalism to more conventional methods of 
linear system analysis. 

If we assume that the system (12) is stationary and that u(t) 
is constant during each sampling period, that is 


OS r<1, ¢=0,1,... (13) 


then (12) can be readily transformed into the more convenient 
discrete form 


u(t + 1) = @(1)x(t) + A(1)u(t); ¢ = 0,1,... 
where [18, 20} 


@(1) = expF = (F° = unit matrix) 
1=0 


A(1) = (f, exp Frdr ) D 


a(tel) 
uret 
= 


Fig. 2 Satta Gripen tee 
system 


See Fig. 2. One could also express exp Fr in closed form using 
Laplace transform methods [{18, 20, 22, 24]. If u(t) satisfies (13) 
but the system (12) is nonstationary, we can write analogously 


x(t + 1) = + 1; + Auld) 


é=0,1,... 
y(t) = 


(14) 


but of course now ®(t + 1; ¢), A(t) cannot be expressed in gen- 
eral in closed form. Equations of type (14) are encountered fre- 
quently also in the study of complicated sampled-data systems 
[22]. See Fig. 2. 

@(i + 1; 1) is the transition matriz of the system (12) or (14). 
The notation ®(t,; t,) (t, 4 = integers) indicates transition from 
time tf, to time &. Evidently ®(t; t) = | = unit matrix. If the 
system (12) is stationary then + 1; t) = + = 
@(1) = const. Note also the product rule: @®(t; s)}@(s; r) = 
@(t; r) and the inverse rule ®~'(t; s) = @(s; t), where t, s, r are 
integers. In a stationary system, @(t; = exp F(t — 7). 

As a result of the preceding discussion, we shall represent ran- 
dom phenomena by the model 


x(t + 1) = + 1; + 


where {u(t)} is a vector-valued, independent, gaussian random 
process, with zero mean, which is completely described by (in 
view of Theorem 5 (C)) 


Eu(t) = O for all ¢; 
Eu(tju'(s) = 0 if t#s 
Eu(t)u'(t) = C(t). 


(15) 


Of course (Theorem 5 (A)), x(t) is then also a gaussian random 
process with zero mean, but it is no longer independent. In fac, 
if we consider (15) in the steady state (assuming it is a stable sys- 
tem), in other words, if we neglect the initial state x(t), then 
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t—1 


x(t) = + Lu(r). 


Therefore if £ = s we have 


Ex(t)x'(s) = + + 1). 


Thus if we assume a linear dynamic model and know the statistical 
properties of the gaussian random excitation, it is easy to find the 
corresponding statistical properties of the gaussian random proc- 
ess { }. 

In real life, however, the situation is usually reversed. One is 
given the covariance matrix Ex(t)x’(s) (or rather, one attempts to 
estimate the matrix from limited statistical data) and the prob- 
lem is to get (15) and the statistical properties of u(t). This is a 
subtle and presently largely unsolved problem in experimentation 
and data reduction. As in the vast majority of the engineering 
literature on the Wiener problem, we shall find it convenient to 
start with the model (15) and regard the problem of obtaining the 
model itself as a separate question. To be sure, the two problems 
should be optimized jointly if possible; the author is not aware, 
however, of any study of the joint optimization problem. 

In summary, the following assumptions are made about random 
processes: 

Physical random phenomena may be thought of as due to primary 
random sources exciting dynamic systems. The primary sources are 
assumed to be independent gaussian random processes with zero 
mean; the dynamic systems will be linear. The random processes 
are therefore described by models such as (15). The question of how 
the numbers specifying the model are obtained from experimental 
data will not be considered. 


Solution of the Wiener Problem 


Let us now define the principal problem of the paper. 
Problem 1. Consider the dynamic model 


x(t + 1) = @(t + 1; t)x(t) + u(t) (16) 
y(t) = M(t)x(¢) (17) 


where u(t) is an independent gaussian random process of n-vectors 
with zero mean, x(t) is an n-vector, y(t) is a p-vector (p S n), 
M(t + 1; t), M(t)aren Xn, resp. p X n, matrices whose elements 
are nonrandom functions of time. 

Given the observed values of y(t), . . ., y(t) find an estimate 
x*(t,{0) of x(t,) which minimizes the expected loss. (See Fig. 2, 
where A(t) = 1.) 

This problem includes as a special case the problems of filter- 
ing, prediction, and data smoothing mentioned earlier. It in- 
cludes also the problem of reconstructing all the state variables of 
a linear dynamic system from noisy observations of some of the 
state variables (p < n/). 

From Theorem 2-a we know that the solution of Problem I is 
simply the orthogonal projection of x(t,) on the linear manifold 
Y(t) generated by the observed random variables. As remarked 
in the Introduction, this is to be accomplished by means of a 
linear (not necessarily stationary!) dynamic system of the general 
form (14). With this in mind, we proceed as follows. 

Assume that y(t), ..., y(t — 1) have been measured, i.e., that 
Y(t — 1) is known. Next, at time ¢, the random variable y(t) is 
measured. As before let y(t|¢ — 1) be the component of y(t) 
orthogonal to Y(t — 1). If y(t|t — 1) =0, which means that the 
values of all components of this random vector are zero for almost 
every possible event, then Y(t) is obviously the same as ‘Y(t — 1) 
and therefore the measurement of y(t) does not convey any addi- 
tional information. This is not likely to happen in a physically 
meaningful situation. In any case, y (t\t — 1) generates a linear 
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manifold (possibly 0) which we denote by Z(t). By definition, 
Y(t — 1) and Z(t) taken together are the same manifold as Y(t), 
and every vector in Z(t) is orthogonal to every vector in ‘Y(t — 1). 

Assuming by induction that x*(¢, — 1|¢ — 1) is known, we can 
write: 


x*(t,|¢) 


+ 1; — — 1) + — 1)] 
+ (18) 


where the last line is obtained using (16). 

Let 4; = ¢ + 8, where s is any integer. If s = 0, then u(t, — 1) 
is independent of Y(t — 1). This is because u(t; — 1) = u(t + 
s — 1) is then independent of u(t — 2), u(t — 3), ... and therefore 
by (16-17), independent of y(t), ..., y(t — 1), hence independent 
of Y(t— 1). Since, for all t, u(t) has zero mean by assumption, it 
follows that u(t, — 1) (s = 0) is orthogonal to Y(t — 1). Thus if 
s 2 0, the second term on the right-hand side of (18) vanishes; if 
s < 0, considerable complications result in evaluating this term. 
We shall consider only the case 4, 2 ¢t. Furthermore, it will 
suffice to consider in detail only the case 4 = ¢ + 1 since the 
other cases can be easily reduced to this one. 

The last term in (18) must be a linear operation on the random 
variable y(tle — 1): 


E[x(t + 1)|Z(t)] = A*(Oy(t|¢ — 1) (19) 
where A*(t) is ann X p matrix, and the star refers to “optimal 
filtering.”’ 

The component of y(t) lying in ‘Y(t — 1) is y(t t — 1) = M(t)x* 
(tlt —1). Hence 
y(t\t — 1) = y(t) — — 1) = y(t) — — 1) (20) 
Combining (18-20) (see Fig. 3) we obtain 
x*(¢+ = @*(¢ + 1; — 1) + A*(Oy(t) (21) 
where 
+ 1; t) = @ + 1; — AN (22) 


Thus optimal estimation is performed by a linear dynamic 
system of the same form as (14). The state of the estimator is 
the previous estimate, the input is the last measured value of the 
observable random variable y(t), the transition matrix is given by 
(22). Notice that physical realization of the optimal filter requires 
only (7) the model of the random process (77) the operator A*(t). 

The estimation error is also governed by a linear dynamic 
system. In fact, 


x(t + = + 1) — x*(t + 1/8) 


= + 1; + u(t) — + 1; -- 1) 
— A*(t)M(t)x(t) 


| 

IL 


Fig. 3 Matrix block diagram of optimal filter 
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= + 1; — 1) + ult) (23) 


Thus®* is also the transition matrix of the linear dynamic system 
governing the error. 

From (23) we obtain at once a recursion relation for the co- 
variance matrix P*(¢) of the optimal error x(¢t — 1). Noting 
that u(t) is independent of x(t) and therefore of x(t\t — 1), we get 


+ 1) = + + 1/0) 
= @*(t + 1; — — + 1; + 
= + 1; — — + 1; + AH 
= + 1; + 1; t) + Ald) (24) 


where Q(t) = Eu(t)u’(t). 
There remains the problem of obtaining an explicit formula for 
A* (and thus also for®*). Since, 


x(t + 1/Z(t)) = x(t + 1) — + 1)/Z(0)] 
is orthogonal to y(t)t — 1), it follows by (19) that 


= E[x(t + 1) — A*(t)y(t)t — — 1) 


Ex(t + — 1) — — — 1). 
Noting that x(¢ + We — 1) is orthogonal to Z(t), the definition of 
P(t) given earlier, and (17), it follows further 
= Ex(t + — — 1) — 

= E(@(t + 1; — 1) + — 1) — 

— A*(t)M(t)P*(t)M (1). 
Finally, since u(t) is independent of x(t), 
= + 1; — (0). 


Now the matrix M(t)P*(t)M(¢) will be positive definite and hence 
invertible whenever P*(t) is positive definite, provided that none 
of the rows of M(t) are linearly dependent at any time, in other 
words, that none of the observed scalar random variables y(t), 

. -» Ym(t) is a linear combination of the others. Under these 
circumstances we get finally: 


A*(t) = @(t + 1; (25) 


Since observations start at to, x(tolfo — 1) = x(t); to begin the 
iterative evaluation of P*(t) by means of equation (24), we must 
obviously specify P*(t,) = Ex(to)x'(to). Assuming this matrix is 


positive definite, equation (25) then yields A*(t)); equation (22) 
@*(% + 1; t&), and equation (24) P*(% + 1), completing the 
cycle. If now Q(t) is positive definite, then all the P*(t) will be 
positive definite and the requirements in deriving (25) will 
be satisfied at each step. 

Now we remove the restriction that 4; = ¢ + 1. Since u(t) is 
orthogonal to Y(t), we have 


+ 1\t) = + 1; + w(t)|Y(O)] = + 1; 


Hence if @(¢ + 1; ¢) has an inverse @(t; ¢ + 1) (which is always 
the case when @ is the transition matrix of a dynamic system 
describable by a differential equation) we have 


= @(t; t+ 1)x*(t + 1/2) 
If t, = ¢ + 1, we first observe by repeated application of (16) that 


x(t + s) = @(t + 8s; ¢ + 1)x(t + 1) 
s—l 
+ +s; (8 21) 
rl 


Since u(t + s — 1),..., u(t + 1) are all orthogonal to Y/(¢), 
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+ s\t) = E[x(t + s)\ 
= + 8; t+ + 1) 
=@t+s t+ (821) 
If s < 0, the results are similar, but x*(¢ — s|t) will have 


(1 — s)\(n — p) co-ordinates. 
The results of this section may be summarized as follows: 
Theorem 3. (Solution of the Wiener Problem) 
Consider Problem I. The optimal estimate x*(t + 1\t) of x(t + 1) 
given y(to), ..., y(t) is generated by the linear dynamic system 


1t) = @*(t + 1; — 1) + (21) 
The estimation error is given by 
x(t + Lt) = + 1; — 1) + w(t) (23) 
The covariance matrix of the estimation error is 
cov x(t\t — 1) = Ex(tt — 1) = Pt) (26) 
The expected quadratic loss is 
n 
— 1) = trace P*() (27) 
i=1 
The matrices A*(t),®*(t + 1; t), P*(t) are generated by the recursion 
relations 


A*(t) = @(t + 1; (28) 


@*(t+ 1; t) = @t4+ 1; (29) 
1) = + 1; t+ 1; 


+ A(t) (30) 


In order to carry out the iterations, one must specify the covariance 
P*(t.) of x(&) and the covariance Q(t) of u(t). Finally, for any 
s20, if ® is invertible 


x*(t + st) = + 8; +108) 


= t+ + 1; - 1) 
K w(t +8 — — 1) 


+ + 8; t+ IA*(t)y(t) (31) 


so that the estimate x*(t + st) (s 2 0) is also given by a linear dy- 
namic system of the type (21). 

Remarks. (/) Eliminating A* and @®* from (28-30), a nonlinear 
difference equation is obtained for P*(t): 


+ 1; 2) + (t 2 t&) (32) 


This equation is linear only if M(t) is invertible but then the 
problem is trivial since all components of the random vector x(¢) 
are observable P*(¢ + 1) = Q(t). Observe that equation (32) 
plays a role in the present theory analogous to that of the Wiener- 
Hopf equation in the conventional theory. 

Once P*(t) has been computed via (32) starting at ¢ = &, the 
explicit specification of the optimal linear filter is immediately 
available from formulas (29-30). Of course, the solution of 
Equation (32), or of its differential-equation equivalent, is a 
much simpler task than solution of the Wiener-Hopf equation. 

(7) The results stated in Theorem 3 do not resolve completely 
Problem I. Little has been said, for instance, about the physical 
significance of the assumptions needed to obtain equation (25), 
the convergence and stability of the nonlinear difference equa- 
tion (32), the stability of the optimal filter (21), ete. This can 
actually be done in a completely satisfactory way, but must be 
left to a future paper. In this connection, the principal guide and 
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tool turns out to be the duality theorem mentioned briefly in the 
next section. See [29]. 

(j) By letting the sampling period (equal to one so far) ap- 
proach zero, the method can be used to obtain the specification of 
a differential equation for the optimal filter. To do this, i.e., to 
pass from equation (14) to equation (12), requires computing 
the logarithm F* of the matrix®*. But this can be done only if 
* is nonsingular—which is easily seen not to be the case. This 
is because it is sufficient for the optimal filter to have n — p state 
variables, rather than n, as the formalism of equation (22) would 
seem to imply. By appropriate modifications, therefore, equa- 
tion (22) can be reduced to an equivalent set of only n — p equa- 
tions whose transition matrix is nonsingular. Details of this type 
will be covered in later publications. 

(k) The dynamic system (21) is, in general, nonstationary. 
This is due to two things: (1) The time dependence of ®(t + 1; t) 
and M(t); (2) the fact that the estimation starts at 4 = & and 
improves as more data are accumulated. If@®, M are constants, 
it can be shown that (21) becomes a stationary dynamic system in 
the limit !—> ©. This is the case treated by the classical Wiener 
theory. 

(1) It is noteworthy that the derivations given are not affected 
by the nonstationarity of the model for x(t) or the finiteness of 
available data. In fact, as far as the author is aware, the only 
explicit recursion relations given before for the growing-memory 
filter are due to Blum [12]. However, his results are much more 
complicated than ours. 

(m) By inspection of Fig. 3 we see that the optimal filter is a 
feedback system, and that the signal after the first summer is 
white noise since y(¢t — 1) is obviously an orthogonal random 
process. This corresponds to some well-known results in Wiener 
filtering, see, e.g., Smith [28], Chapter 6, Fig. 6-4. However, 
this is apparently the first rigorous proof that every Wiener filter 
is realizable by means of a feedback system. Moreover, it will be 
shown in another paper that such a filter is always stable, under 
very mild assumptions on the model (16-17). See [29]. 


The Dual Problem 


Let us now consider another problem which is conceptually 
very different from optimal estimation, namely, the noise-free 
regulator problem. In the simplest cases, this is: 

Problem Il. Consider the dynamic system 


x(t + 1) = + 1; t)x(t) + (33) 


where x(t) is an n-vector, u(t) is an m-vector (m S n), ®, M are 
n Xn resp. n X m matrices whose elements are nonrandom func- 
tions of time. Given any state x(t) at time t, we are to find a sequence 


u(t), .. ., u(7’) of control vectors which minimizes the performance 
index 
T+1 
Vix(t)] = 
r=t 


where A(t) is a positive definite matrix whose elements are nonran- 
dom functions of time. See Fig. 2, where & = Mand M = I. 

Probabilistic considerations play no part in Problem II; it is 
implicitly assumed that every state variable can be measured 
exactly at each instant t,t + 1,..., T. It is customary to call 
T = t the terminal time (it may be infinity). 

The first general solution of the noise-free regulator problem is 
due to the author [18]. The main result is that the optimal con- 
trol vectors u*(t) are nonstationary linear functions of x(t). After 
a change in notation, the formulas of the Appendix, Reference 
[18] (see also Reference [23]) are as follows: 


u*(t) = — A*(t)x(t) (34) 


Under optimal control as given by (34), the ‘‘closed-loop’’ equa- 
tions for the system are (see Fig. 4) 
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x(t + 1) = @*(t + 1; 


and the minimum performance index at time ¢ is given by 


V*[x(t)] = x’(t)P*(¢ — 1)x(t) 


The matrices A*(t), @*(¢ + 1; 


recursion relations: 
A*(t) = (OP*(O@U + 1; 0) (35) 
@*(t+ 1; = @¢4+ 1; — (36) 


PHYSICAL SYSTEM TO BE 


t), ®*(t) are determined by the 


ta? 


— 1) = + 1; OP + 1; 


+ At) (37) 


Initially we must set P*(7') = Q(T + 1). 


CONTROLLED 


mn) 


Fig. 4 Matrix block diagram of optimal controller 


Comparing equations (35-37) with (28-30) and Fig. 3 with Fig. 


Theorem 4. (Duality Theorem) 


4 we notice some interesting things which are expressed precisely 
by 


Problem I and Problem II are 


duals of each other in the following sense: 


Let r = 0. Replace every matrix X(t) = X(t + 7) in (28-30) 


by K'(t) = R(T — 1). Then one has (35-37). Conversely, re- 
place every matriz X(T — 1) in (35-37) by X(t) + r). Then one 
has (28-30). 


Proof. Carry out the substitutions. For ease of reference. the 


dualities between the two problems are given in detail in Table 1. 


Table 1 


Problem | 


x(t) (unobservable) state 
variables of random proc- 


ess. 

y(t) observed random varia- 
bles. 

to first observation. 

+ 7 + 1; bo + 7) transi- 
tion matrix. 

P*(t) + +r) covariance of opti- 
mized estimation error. 


A*(to + 1) weighting of ob- 
servation for optimal esti- 
mation. 

+ 7 + :1; to + 7) transi- 
tion matrix for optimal es- 
timation error. 

M(t. + 1) effect of state on 
observation. 

Q(t) + 7) covariance of ran- 
dom excitation. 


Problem Il 
x(t) (observable) state varia- 
bles of plant to be regu- 
lated. 
u(t) control variables. 


T last control action. 

@(T — ++ 1; T — transi- 

_ tion matrix. 

P*(7T — +) matrix of quad- 
ratic form for performance 
index under optimal regu- 
lation. 

A*(T - 7) weighting of state 
for optimal control. 


@*(T —++1; transi- 
tion matrix under optimal 

Tegulation. 

M(T — 1) effect of control 
vectors on state. 
(T — r) matrix of quadratic 
form defining error cri- 
terion. 


Remarks. (n) The mathematical significance of the duality be- 


tween Problem I and Problem II is that both problems reduce to 
the solution of the Wiener-Hopf-like equation (32). 


(0) The physical significance of the duality is intriguing. Why 


are observations and control dual quantities? 
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Recent research [29] has shown that the essence of the Duality 
Theorem lies in the duality of constraints at the output (repre- 
sented by the matrix M(t) in Problem I) and constraints at the 
input (represented by the matrix M(t) in Problem II). 

(p) Applications of Wiener’s methods to the solution of noise- 
free regulator problem have been known for a long time; see the 
recent textbook of Newton, Gould, and Kaiser [27]. However, 
the connections between the two problems, and in particular the 
duality, have apparently never been stated precisely before. 

(q) The duality theorem offers a powerful tool for developing 
more deeply the theory (as opposed to the computation) of Wiener 
filters, as mentioned in Remark (i). This will be published 
elsewhere [29]. 


Applications 


The power of the new approach to the Wiener problem, as ex- 
pressed by Theorem 3, is most obvious when the data of the 
problem are given in numerical form. In that case, one simply 
performs the numerical computations required by (28-30). Re- 
sults of such calculations, in some cases of practical engineering 
interest, will be published elsewhere. 

When the answers are desired in closed analytic form, the itera- 
tions (28-30) may lead to very unwieldy expressions. In a few 
cases, A* and ®* can be put into “closed form.’’ Without dis- 
cussing here how (if at all) such closed forms can be obtained, we 
now give two examples indicative of the type of results to be ex- 
pected. 

Example 1. Consider the problem mentioned under “Optimal 
Estimates.’’ Let 2,(¢) be the signal and z,(t) the noise. 
sume the model: 


We as- 


z(t 1) = + t)x,(t) u(t) 


+ 1) = 
= z(t) + 


The specific data for which we desire a solution of the estimation 
problem are as follows: 


1&4 =t+1;464=0 

2 Ex,>0) = 0,i.e., = 0 

3 EuXt) = at, EuXt) = b*, Eu(t)u(t) = 0 (for all 0) 
4 +1; t) = bu 


= const. 


A simple calculation shows that the following matrices satisfy 
the difference equations (28-30), for allt = t: 


= wei 
0 


i -C 


duB Cl) 0 


( 
Ce + 1) = 


where 


t20 (38) 


Since it was assumed that z,(0) = 0, neither z,(1) nor z2(1) can 
be predicted from the measurement of y,(0). Hence the meas- 
urement at time ¢ = 0 is useless, which shows that we should set 
C(O) = 0. This fact, with the iterations (38), completely deter- 
mines the function C(t). The nonlinear difference equation (38) 
plays the role of the Wiener-Hopf equation. 

If b?/a?< 1, then C(t) = 1 which is essentially pure predic- 
tion. If b*/a? > 1, then C(t) = 0, and we depend mainly on 
x *(tt — 1) for the estimation of z,*(¢ + 14) and assign only 
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very small weight to the measurement y;(t); this is what one 
would expect when the measured data are very noisy. 

In any case, z.*({t — 1) = 0 at all times; one cannot predict 
independent noise! This means that $*,,. can be set equal to 
zero. The optimal predictor is a first-order dynamic system. 
See Remark (j). 

To find the stationary Wiener filter, lett = © on both sides of 
(38), solve the resulting quadratic equation in C( @), etc. 

Example 2. A number of particles leave the origin at time & = 0 
with random velocities; after t = 0, each particle moves with a 
constant (unknown) velocity. Suppose that the position of one 
of these particles is measured, the data being contaminated by 
stationary, additive, correlated noise. What is the optimal 
estimate of the position and velocity of the particle at the time 
of the last measurement? 

Let 2;(t) be the position and z,(t) the velocity of the particle; 
z(t) is the noise. The problem is then represented by the model, 


a(t + 1) = a(t) + z(t) 
zf{t+1)= 
a(t + 1) 


z{t) 

+ 1; + 
yi(t) = a(t) + a(t) 

and the additional conditions 


ty = 0 

2 Exz,0) = Ex{0) = 0, Ez.%0) = 
3 Eu(t) = 0, Eu%Xt) = 

4 du(t + 1; 1) = dy = const. 


a® > 0; 


According to Theorem 3, x*(t't) is calculated using the dynamic 
system (31). 

First we solve the problem of predicting the position and ve- 
locity of the particle one step ahead. Simple considerations 
show that 

a? 0 
P*(1) =] a? a? O 
0 O 


0 
and A*(0)=|0 
1 


It is then easy to check by substitution into equations (28-30) 
that 


P*(t) = 


t — 1) 
x t 1 — da(t — 1) 
— dast(t — 1) — dis(t - 1) — 1)? + Cit 1) 


is the correct expression for the covariance matrix of the predic- 
tion error x(t|t — 1) for all ¢ 2 1, provided that we define 

C\(0) = b*/a? 

= C(t — 1) + [t — da(t 


It is interesting to note that the results just obtained are valid 
also when @;; depends on ¢. This is true also in Example 1. In 
conventional treatments of such problems there seems to be an 
essential difference between the cases of stationary and nonsta- 
tionary noise. This misleading impression created by the con- 
ventional theory is due to the very special methods used in solving 
the Wiener-Hopf equation. 

Introducing the abbreviation 


C0) = 0 
= t — dx(t — 1),t 21 


and observing that 
cov x(t + 1|t) = P*(t + 1) 
= @(t + 1; [cov + 1;0 + QW 
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the matrices occurring in equation (31) and the covariance 


matrix of x(t \t) are found after simple calculations. We have, 
for allt = 0, 


Ct) 
Ci(t) — t,t) 


Dt; t+ + 1; + 1; 
C\(t) — tC,(t) — tC;(t) — 


—C(t) + —Ci(t) + 


1 
(t; t+ Cw 


and 


t 
cov x(t\t) = = —— 1 -t 


To gain some insight into the behavior of this system, let us 
examine the limiting case {> © of a large number of observa- 
tions. Then C,(t) obeys approximately the differential equation 

dC\(t)/dt = C,*(t) (t>1) 


from which we find 


= (1 — + 1 — + + b?/a? 
(t>>1) (39) 
Using (39), we get further 


1 1 0 0 
= 0 i and ®“A*=>] 0 
—1 0 1 

(t>1) 


Thus as the number of observations becomes large, we depend 
almost exclusively on 2,*(¢t) and 22*(tt) to estimate 2,*(¢ + 
1¢ + 1) and z.*(t + 1t + 1). Current observations are used 
almost exclusively to estimate the noise 


xs*(tt) = y(t) — (t >1) 


One would of course expect something like this since the prob- 
lem is analogous to fitting a straight line to an increasing number 
of points. 

As a second check on the reasonableness of the results given, 
observe that the case ¢ > 1 is essentially the same as prediction 
based on continuous observations. Setting @; = 0, we have 


(t>1; dss = 0) 

= ; Os = 

b+ 

which is identical with the result obtained by Shinbrot [11], 
Example 1, and Solodovnikov [14], Example 2, in their treat- 
ment of the Wiener problem in the finite-length, continuous-data 
case, using an approach entirely different from ours. 


Conclusions 


This paper formulates and solves the Wiener problem from the 
“state’’ point of view. On the one hand, this leads to a very gen- 
eral treatment including cases which cause difficulties when at- 
tacked by other methods. On the other hand, the Wiener prob- 
lem is shown to be closely connected with other problems in the 
theory of control. Much remains to be done to exploit these 
connections. 
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APPENDIX 
RANDOM PROCESSES: BASIC CONCEPTS 


For convenience of the reader, we review here some elementary 
definitions and facts about probability and random processes. 
Everything is presented with the utmost possible simplicity; 
for greater depth and breadth, consult Laning and Battin [5] or 
Doob [15]. 

A random variable is a function whose values depend on the out- 
come of a chance event. The values of a random variable may be 
any convenient mathematical entities; real or complex numbers, 
vectors, etc. For simplicity, we shall consider here only real- 
valued random variables, but this is no real restriction. Random 
variables will be denoted by z, y, . . . and their values by &, 9, .... 
Sums, products, and functions of random variables are also ran- 
dom variables. 

A random variable z can be explicitly defined by stating the 
probability that z is less than or equal to some real constant &. 
This is expressed symbolically by writing 


Pr(z F(§); F,(—@) 0, = 1 


F,(&) is called the probability distribution function of the random 
variable z. When F,(£) is differentiable with respect to &, then 
SA&) = dF,(£)/dE is called the probability density function of z. 

The expected value (mathematical expectation, statistical average, 
ensemble average, mean, etc., are commonly used synonyms) of 
any nonrandom function g(z) of a random variable z is defined by 


Eg(z) = = = (40) 
As indicated, it is often convenient to omit the brackets after 
the symbol FE. A sequence of random variables (finite or infinite) 

=.. 


is called a discrete (or discrete-parameter) random (or stochastic) 
process. One particular set of observed values of the random 
process (41) 


A—1), 2(0), 2(1),... (41) 


&—1), &(0), &(1),... 


is called a realization (or a sample function) of the process. In- 
tuitively, a random process is simply a set of random variables 
which are indexed in such a way as to bring the notion of time into 
the picture. 
A random process is uncorrelated if 
Ex(t)x(s) = Ex(t)Ex(s) (t # 8) 
If, furthermore, 


Ex(t)z(s) = 0 (t # 8) 


then the random process is orthogonal. Any uncorrelated random 
process can be changed into orthogonal random process by re- 
placing z(t) by x(t) = z(t) — Ex(t) since then 


Ex'(t)z'(s) = E[x(t) — Ex(t)]|-[z(s) — Ex(s)] 
= Ex(t)z(s) — Ex(t)Ex(s) = 0 


It is useful to remember that, if a random process is orthogonal, 
then 


E[x(t;) + +...]*? = + +... (4 #4... 


If x is a vector-valued random variable with components 2, . . . 
z, (which are of course random variables), the matrix 


[E(z; — Ex,\(z; — Ex;)) = E(x — Ex)(x' — Ex’) 
=covx (42) 


is called the covariance matrix of zx. 
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A random process may be specified explicitly by stating the 
probability of simultaneous occurrence of any finite number of 
events of the type 


Pr{(2(t) s 


cen MELE 


whete F zits) is called the joint probability distribution 
function of the random variables z(t), . . ., z(t,). The joint 
probability density function is then 


provided the required derivatives exist. The expected value 
Eg{z(t), ..., 2(t,)] of any nonrandom function of n random varia- 
bles is defined by an n-fold integral analogous to (40). 

A random process is independent if for any finite t, ¥ 
(43) is equal to the product of the first-order distributions 


Pr{z(t) s &)... Pr{z(t,) s &,] 


If a set of random variables is independent, then they are obvi- 
ously also uncorrelated. The converse is not true in general. For 
a set of more than 2 random variables to be independent, it is not 
sufficient that any pair of random variables be independent. 

Frequently it is of interest to consider the probability distribu- 
tion of a random variable z(t,4,) of a random process given the 
actual values &(t,), . . ., &(t,) with which the random variables 
z(t,),..., 2(t,) have occurred. This is denoted by 


En +1 
= 44 


which :: called the conditional probability distribution function of 
2(tn+i) given x(t,), ..., 2(t,). The conditional expectation 


is defined analogously to (40). 
random variable; it follows that 


The conditional expectation is a 


. . ., 2(t,)}] = 


In all cases of interest in this paper, integrals of the type (40) 
or (44) need never be evaluated explicitly; only the concept of the 
expected value is needed. 

A random variable z is gaussian (or normally distributed) if 


PL” 2 Ez — Ez) 
which is the well-known bell-shaped curve. Similarly, a random 
vector x is gaussian if 


1 - 
= exp | - (E — Ex)'C-“E — | 


where C-! is the inverse of the covariance matrix (42) of x. A 
gaussian random process is defined similarly. 

The importance of gaussian random variables and processes is 
largely due to the following facts: 

Theorem 5. (A) Linear functions (and therefore conditional ex- 
pectations) on a gaussian random process are gaussian random 
variables. 

(B) Orthogonal gaussian random variables are independent. 

(C) Given any random process with means Ex(t) and covariances 
Ex(t)x(s), there exists a unique gaussian random process with the 
same means and covariances. 
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Shannon’s Theory and Feedback Systems 


Feedback can be used as a means of realizing Shannon-predicted errorless capacity of a 
noisy communication channel. On the other hand, Shannon's information theory can 
be used as a guide for rating feedback control systems as well as selecting system com- 
ponents. A brief summary of the former aspect and the results from some preliminary 


investigations of the latter aspect are presented. Included topics are calculations of re- 
quired information capacities of control systems from input characteristics and fidelity 
requirements, required information capacities of system components, and calculation 
of information capacities of system components from saturation limits, threshold levels, 


iil INFORMATION THEORY is related to feed- 
back systems in two ways: 


1 Use of feedback to improve communication reliability. 
2 Use of information theory concept to determine necessary 
requirements for control-system components. 


The first item has been investigated by many researchers in- 
cluding Shannon himself. One significant result is that, with feed- 
back, Shannon’s coding theorem for a noisy channel can be 
realized with a much shorter coding length. 

The second item is a much greener area. Research work on 
nonlinear control systems in the past few years has been con- 
centrated on large signal nonlinearities, e.g., saturation, while 
studies on small signal nonlinearities are relatively limited in 
scope. The information theoretical concept can be summarized 
as, “‘In order to achieve desired resolution and speed of response, 
every component in the system has to have its required informa- 
tion capacity.’’ The information capacity of components and 
systems is formulated. These capacities may be useful criteria 
in the merit rating of systems as well as in the selection of com- 
ponents. 


| Feedback as a Means of Improving Communication 
Reliability 

Shannon [1]! has shown that associated with each noisy com- 
munication channel, there is a definite channel capacity. Mes- 
sages can be sent as error free as desired if they are transmitted at 
a rate smaller than the channel capacity, but not at a rate larger 
than the latter. What we mean by “error free’’ is not that the re- 
ceived signal is a replica of the transmitted signal, which is im- 
possible with a noisy channel, but rather that by sending a 
coded version of the message, there is overwhelming probability 
that the received signal is among the ones which are decoded into 
the original message according to the prescribed manner of de- 
coding. 

While Shannon’s theory is sound, there is no practical method 
of realizing his rather desirable predictions. One difficulty lies 
in the fact that, in order to make the probability-of-error ap- 
proach zero, the coding length has to approach infinity. This 
difficulty is further accentuated by a second one: Shannon’s 


! Numbers in brackets designate References at end of paper. 
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theory is based on random codes, and there is no known sys- 
tematic code which comes near the channel capacity Shannon 
predicted. For the benefit of uninitiated readers we may explain 
the difference between random codes and systematic code by the 
following example: The conversion from decimal number to bi- 
nary number and vice versa is a systematic code, as it can be speci- 
fied by a simple set of rules. The telephone directory is a random 
code, since there is no way of associating a name to a telephone 
number except through the telephone book. Engineeringwise, it 
is a relatively simple matter to put a systematic coding and de- 
coding scheme into hardware. However, with random coding, 
the required memory units increase exponentially with cod- 
ing length, and the system soon becomes impractical. 

Whenever a separate return channel is available, it can be 
used to feedback some sort of information about the received 
signal to the sender to improve communication reliability. For 
instance, when A is talking to B over the telephone, B may ask A 
to repeat an unclear sentence, or A may ask B to repeat an im- 
portant number or sentence, so that A may be sure that B has 
received the message correctly. While this is such a well-adapted 
practice whenever human beings are present in a communica- 
tion link that we think nothing of it, in a totally mechanized data- 
gathering, transmission, and analysis system without any inter- 
vention of human intelligence, feedback has to be properly 
mechanized if the same principle of improving communication 
reliability is to be realized. The ultimate purpose of studying 
feedback in a communication system is to accomplish this in a 
most proficient manner. 


SENDER FORWARD RECEIVER 
CODER CHANNEL DECODER 


FEEDBACK 
CHANNEL 


Fig. 1 Block diagram of elementary communication systems 


A block-diagram description of an elementary communication 
system is given in Fig.1. The solid lines represent a system with- 
out feedback. The messages are coded in such a way that there is 
least likelihood of confusion of two messages in passing through 
the communication channel. At the receiving end the received 
signal is decoded into the most likely transmitted signal based on 
a priori knowledge of all possible transmitted signals and the 
later knowledge of received signal. Since the coder has no idea 
of what is being received, it simply transmits one or the other 
among a set of preagreed signals. 

The broken lines in Fig. 1 represent introduction of feedback. 
The coder sends out digits not only according to which message 
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is being sent but also according to its knowledge of the signal being 
received. The coder and decoder can be rather simply 
constructed. For instance, the coder may be a device which con- 
tinues to transmit if the feedback signal indicates correct recep- 
tion, but sends out an erasure signal and a repetition of the pre- 
vious signal whenever a wrongly received signal is indicated. 
Alternatively, the feedback channel may transmit simply a 
request for repetition whenever the received message is am- 
biguous. The former arrangement is called information feed- 
back, and is designed to give most reliable transmission if a low 
noise feedback channel of sufficient capacity is available. The 
latter arrangement is called decision feedback. It is used if the 
capacity of the feedback channel is low, or if the noise in the 
feedback channel is such that considerable repetition is required 
to obtain reliable feedback transmission. The decoder in both 
cases simply follows through corrective instructions. More 
elaborated coding and decoding schemes in a system with feed- 
back have also been investigated. 
Previous investigations have established the following: 


1 With feedback, errorless or almost errorless transmission at 
a rate equal or close to the forward channel capacity is possible 
with virtually no coding or very little coding [2 to 7]. 

2 Limiting the coding length in both types of systems, the 
sum total of the channel capacities required for both the forward 
and feedback transmissions is less than that of a unidirectional 
channel alone using coding only to achieve the same message re- 
liability [8}. 

3 However, no matter how perfect the feedback transmission 
is and how elaborate the coding and feedback scheme is, it is not 
possible to transmit almost error-free information at a rate 
greater than the channel capacity of the forward channel, caleu- 
lated on the basis of no feedback [9, 10]. 


In other words, feedback is an effective and practical means of 
realizing the almost error-free capacity predicted by Shannon. 
It is also an effective means for two-way communication. 

Investigations on the comparative costs of the logical circuitry 
for the feedback and unidirectional systems also greatly favor 
the feedback system [11]. 


Il Information Flow Criteria in Control Systems 


A control system can be regarded as a special case of a com- 
munication channel in which the desired output bears a simple 
operational relationship with the input. As such it is required to 
transmit information at the rate of the desired output which takes 
the place of “‘signal’’ in communication problems, and one of the 
necessary properties of the system components is that they must 
have adequate capacity for transmitting this information. We 
shall refer to the desired output as the signal. A preliminary 
study on ‘.e following facets of the problem will be given: 

(a) _.equired system capacity from given signal properties and 
allowed err 

(b) Re capacity of system components. 

(c) Iniormation capacity of system components. 


Required System Capacity From « Description of Input and Allowed 
Error. The allowed error is a necessary concept in studying 
the capacity required for transmitting a continuous signal. It 
may be considered as a sort of quantization of the continuous 
signal into discrete levels, and the information rate is a measure 
of the selection or choice the signal may make upon the quantized 
levels. Shannon defines the information rate of a continuous 
source relative to a given fidelity criterion as the least channel 
capacity required to transmit the signal within the fidelity re- 
quirements. We shall assume the fidelity requirement to be an 
allowed maximum mean square error in the following evaluations 
of information rates: 
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1 Random Signal With Mean Power and Bandwidth Limitations. If the 
mean square values of the signal and allowed error are S and E, 
respectively, and the signal is limited to bandwidth W, the in- 
formation rate is at most [12] 


R = W log (S/E) (1) 


The base of logarithm determines the unit in information rate. 
If 2 is used as base, the information rate is in bits per second. 
Throughout the present paper we shall use natural logarithm to 
facilitate derivations of equations. Its conversion into bits can 
be readily made by division with log 2. 

We mean by “at most’’ that Equation (1) is derived on the 
basis of mean power and bandwidth limitations only. Specifying 
the signal power spectrum (which is equivalent to specifying the 
correlation of successive signal samples) or specifying the ampli- 
tude distribution, or phase coherence generally has the effect of 
reducing R. However, in cases where further knowledge about 
the signal is lacking, the rate as expressed in Equation (1) must 
be met. 

Equation (1) was derived by Shannon. Heuristically, it is easy 
to understand. From the sampling theorem, the number of in- 
dependent signal samples per second is 2W. The number of 
probable signal levels is Vv. S/E. Therefore 


R = 2W log W/S/E = W log (S/E) 


Equation (1) is not good if S/E is too low. In Shannon’s deriva- 
tion it was assumed that, knowing the received signal, the condi- 
tional distribution of the transmitted signal is the same as the 
error distribution. While this is approximately true if S/E is 
large, it does not hold if S< #. Unrestrained use of Equation 
(1) leads to negative rate in case S < E. 

A possible alternative is to assume that the signal and error are 
uncorrelated. It follows that [13] 


(1a) 


S+E 
R = W log [$+] 


While this expression eliminates the absurdity of negative rate, it 
introduces a debacle of its own. If the allowed error power is 
larger than the signal power, obviously no transmission at all is 
required; but Equation (1a) gives a positive rate rather than the 
expected zero. The difficulty arises from the assumption that 
signal] and error are uncorrelated, as it breaks down at low signal 
level. 

Instead of Equation (1a), the following approximate form will 
be used: 


R= W log =, for S > E 
(2) 
= 0, for S< E 


Equation (2) is continuous, and is correct for the case of large S 
as wellas S < E. 

Sometimes the signal power spectrum ®,( f) is given and is not 
uniform within W. The signal is still random in the sense that 
its various sinusoidal components are uncorrelated. Assuming 
that ® f) > ®, f) for all f < W, Equation (2) can be modified 
to: 


WwW 
Rm log Mas (3) 


The mean-square-error fidelity criterion does not determine ®,( f) 
completely. It merely requires that 


w 
(4) 
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Different distributions of ®,(f) satisfying Inequality (4) generally 
give different values of R. The least R with a ®,(f) function 
satisfying the fidelity criterion of Inequality (4) is the information 
rate of the signal according to Shannon’s definition. To deter- 
mine the least R, Lagrange’s method of undetermined multipliers 
is used; 


Ww Ww 


The variation is taken on the function ®,(f), and it gives 


0 
f ant ] = 


‘Therefore 
1 E 
Equation (3) becomes 
w 
R= log df (6) 


In practice, the error of successive output samples are not cor- 
related, which is equivalent to Equation (5). Therefore the 
minimum required rate is generally adequate. 


Fig.2 Power-density spectrum as a function of frequency 


2 Random Signal With Limited Totcl Power but Without Clearcut Limit on 
Bandwidth. Referring to Fig. 2, ®,( f) approaches 0 as f approaches 
infinity so that the total signal power is conserved. The problem 
is to find a channel with minimum capacity which can convey 
this signal within given mean square error limits. For any f let 
the function I{o, ®,( f)] be defined as the smaller of the two—a 
constant o or ®,( f). Given allowed mean square error E, @ is de- 
termined by 


te, = (7) 


The minimum-capacity channel has no transmission at frequen- 
cies for which ®,( f) < o, and transmits the signal with mean 
square error o at frequencies for which ®,f) > o. It can be 
proved either by known inequalities, or by a derivation similar 
to the one that leads to (6) plus some additional arguments. 

The capacity of the minimum channel is the information rate 


by definition: 
R= log 
0 


Example 1. Given 


I{o, f)] 


df (8) 


O(f) = A/f? 


and allowed mean square error Z. Determine the information 
rate. 
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Solution. Since ®,( f) decreases wit f, there is a W beyond 
which ®,( f) < and 


Equations (7) and (8) gives 


A 2A 
E= odf + Ww 


R = og — d = —2 og ~ d = 2Ww 
fo J, W 


Eliminating W between these equations 


4A 
R= E (9) 
3 Random Inputs of Signal Mixed With Noise. When noise is in- 
separably mixed with signal, the required channel capacity of the 
control system is still the needed capacity for transmitting the 
designed output. However, when an input signal is masked by 
noise, its information content R;, is 


< f log + ay (10) 


Where ®,( f) and ®,( f) are power spectra of the input signal and 
noise, respectively, and this value must be greater than the rate 
Ro calculated from the desired output signal and allowed error. 


Unless the foregoing inequality holds, there just is not sufficient 
information in the input to give the desired output accurately 
enough. No control system is capable of doing the job. 

In a number of control problems, the noise is usually introduced 
through the sensing instruments. Inequality (11) may be con- 
sidered a necessary condition on the sensing instruments, because, 
if the sensing instruments are to be any good at all for the ap- 
plication, inequality (11) must be satisfied. 

Equation (10) is derived on the basis of uncorrelated random 
signal and noise with Gaussian distribution. Generally the noise 
is uncorrelated with the input signal and is approximately 
Gaussian distributed. However the input signal is usually 
not Gaussian, and R;, is less than the integral on the right-hand 
side of Equation (10). 

4 Quasi-Determined Signal. One frequently used description of 
reference input signals in control systems is ‘‘steplike’’ or ‘“‘ramp- 
like.’’ These signals are somewhere in between ‘“‘deterministic’’ 
and “random,’’ and have less information rates than their ran- 
dom counterparts with the same power spectra. Their informa- 
tion rates can be estimated by quantizations in time and ampli- 
tude as illustrated in the following example. 

Ezample 2. A steplike signal has on the average v step inputs 
per second. The occurrences as well as amplitude distribution of 
these step inputs are completely independent of each other. The 
amplitude of each step input is Gaussian distributed with mean 
square value a?. Estimate the information rate subject to a 
mean square error E. 

Solution. A close upper bound of the foregoing rate can be ob- 
tained as follows: Let the time and amplitude co-ordinates be 
quantized into segments of lengths 7 and b, respectively. We shall 
determine FR and E in terms of 7 and b, and then minimize E under 
the constraint of constant FR to obtain optimum values of 7 and b. 


vT 1/6\* 
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The two terms on the right hand of Equation (12) give mean- 
square errors due to uncertainties in time of occurrence and 
amplitude, respectively. 

There are two components of the information rate: R, is re- 
quired to convey whether a step input occurs in time interval 7 or 
not, and R, is required to convey the change of amplitude from 
level n; to level Therefore 


R, = —- [yr log vr + (1 — vr) log (1 — pr) (13) 
T 


Generally, close followup of the signal is required which implies 
thatyr< 1. Expanding (1 — vr) log(1 — vr) into a power series 
of vr and neglecting terms of order (vr )? we obtain 


(1 — vr) log (1 — vr) & —pr 


Therefore Equation (13) becomes 
1 
R, > —- (vr log vr — vr) = wo(1 — log vr) (14) 
T 


Assuming that the signal amplitude were in level n,, the probabil- 
ity that it goes into level nz is, for b< a, 


b 


P(ne, =- e 2a? (15) 


R; 
= n,) log p( ne, ni) 
ne 
= log (V/ 2" a b) 
na 
+ — n.)*b?/2a? 
ns 
= log a b) + 
R. = v log (V/ a/b) (16) 
vrb 


For constant 2, vrb is a constant, say, C, and yr = C/b. Equa- 


tion (12) becomes 
a’C 1/b\2 
E= (18) 


Letting dE/db = 0, we obtain 
vra* 2/b6\? 2 E 19 


Substituting Equation (19) into (17) we obtain: 


3 re a? 
R=y E +; log (s (20) 


The input signal described in Example 2 has a power-density 
spectrum of 
va? 


&(f) = mp 


If the signal is completely random, according to Equation (9) 


4va’ 


(21) 
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Normally a? > E, and the rate calculated from Equation (21) is 
much higher than that from Equation (20). 

To illustrate some interesting implications, suppose that there 
are a continuous linear system and a relay-controlled system both 
of which give mean square error E in following up the steplike 
input. The linear system would give the same mean square error 
in following up any random input of the same power spectrum. 
Its required information capacity is at least the value given by 
Equation (21). However nothing can be said about what the 
relay system would do in case of random input, and its required 
capacity is given by Equation (20). To obtain equally good re- 
sponse to steplike inputs, the requirements on information ca- 
pacity of components are more stringent for the linear system 
than that for the relay system. However, the linear system is 
generally more versatile. 

Required Capacity of Sysiem Components. The required capacity 
of system components can be estimated from the following rule: 

“Tf the system output is completely determined by the output 
variable of a certain component, the required capacity of the com- 
ponent in question is at least as great as that of the system itself.’’ 

The question is still open for components which do not have 
complete control over the output variable. There are two possi- 
ble causes of this incompleteness: 


1 The system output is determined by some chance variable in 
addition to the output variable of the component, for instance, 
there may be load disturbance, or small signal nonlinearities like 
slot effect or backlash. While the latter effects are not strictly 
chance variations, their functional dependence on the input varia- 
ble is never utilized. 

2 The system output is determined by the output variable of 
the component together with some other controlling function. 


In case 1, the required capacity of the component is at least as 
great as that of the system itself, and is usually greater since any 
chance variation generally results in a loss of information. In 
case 2, the required capacity of the component may be smaller. 


Fig. 3 Block diagram of a typical control system 


Referring to Fig. 3, the components G,, G,, and G; should have 
at least as much capacity as the system itself. Not counting any 
chance variation 


22) 


GA p)Gip) 
C(p) = Gi p)V = | 
(p) (p)Vo(p) | Vi(p) ( 


Therefore c(t) is completely determined by either of the output 
variables »,(t) or v2(t). 

For nonlinear systems, Equation (22) no longer holds, how- 
ever, the essence of the argument is still valid: c(t) is a function of 
v(t) only, and v,(¢) is a function of »;(t) only. 

The output v; of the feedback element H, does not determine 
c(t) completely. However, in any practical system, the difference 
between r and »; is not only small, but should be a reflection of the 
difference of c and the desired output, rather than some extraneous 
error introduced in the feedback element. The feedback element 
is required to transmit c(t) with a much smaller error than the 
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system error, and consequently its required capacity is considera- 
bly higher than that of the system itself. 

The requirement on the feedback element H; is less stringent. 
Generally H; is used to stabilize the system by shaping the trans- 
fer characteristics in the vicinity of the crossover frequency, and 
the loop gain G.H, can be much less than unity at low frequencies 
where the signal power spectrum is concentrated. As the feed- 
back element H2 does not take an essential part in the information 
tyansmission process, it is not required to have more information 
capacity than that of the system. 

Information Capacity of Control Components. The capacity of a 
control component is determined by its saturation level, its 
threshold level, and how fast a change in signal level can take 
place. To facilitate our analysis, the components can be classified 
as (1) directly limited and (2) indirectly limited. For directly 
limited components, the saturation level Z and threshold level 
l are on the same signal. An example is the digital processing 
unit in a sampled-data system, | is the value represented by a 
“1’’ in the lowest digit, while Z is given by the largest possible 
number times l. Considering both positive and negative levels, 


1 2L 
C — 
log (1 + 


where C is the information capacity of the processing unit and 7’ 
is the sampling time. However, not all the capacity C is utiliza- 
ble. If the error signal is limited to bandwidth W, only 2W 
sampling points are independent: 


2L 
C, = 2W log (1 + *) 


(23) 


9 

(24) 
C, is always less than C, since W < 1/27 must hold in order to avoid 
signal aliasing in a sampled-data system. 

Other examples are the control relays, synchro resolvers, and 
various types of transducers, and so on. In all cases, both capacities 
C and C, exist and are calculable by Equations (23) and (24). 
However, for continuous signal devices, the sampling period is not 
as clearly defined. 


ow 
+ 


Fig. 4 Block diagram representation of a control component with 
indirect limiting 


Indirect limiting is illustrated in Fig. 4. There is either satura- 
tion or mean power limitation on the input signal i(t) and quan- 
tization or uncertainty on the output signal a(t). Consider the 
case of mean power limitation first: 


Ww 
C = W log 2meS — f log |G(j2xf)|*df — W log 2reE 
0 


g Ww 
= W log — I log |G (j2mf)|*df (25) 
E 0 


In Equation (25), S is the mean value of [i(t)]?, E is the mean 
square error introduced in a(t) by the quantization or uncertainty. 
Equation (25) is the direct result of an equation for entropy loss in 
transmitting through a linear network derived by Shannon. 

If instead of mean power limitation, the input 1(¢) is limited by 
saturation level J,, and the resolution of the output variable a(t) 
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is limited by quantization level /,, an approximate expression of C 
can be derived from Equation (25). A uniform distribution of 
i(t) gives maximum rate or capacity. Assuming this, we have 


1 
42 


Equation (25) gives 


w 
C 2W log — f log |G(j2mf)|*af (26) 
0 


If |G(i2xf)| = 1, Equation (26) reduces approximately to (24). 
The difference of 1 level in the logarithm arises out of the ap- 
proximate nature of the derivation. 


This paper gives a brief summary of the results of various in- 
vestigations in applying feedback in communication systems for 
improving message reliability, and also presents a preliminary 
analysis of information flow in feedback control systems. There 
is a good possibility that further development along the latter 
lines may result in worthwhile criteria for both control systems 
and control components. 
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Multipass Heat Exchangers 


The transfer functions obtained by dynamic analysis of one shell pass and 2, 3, 4, 
, 2n, 2n + 1 tube-pass heat exchangers as a distributed parameter system are pre- 


sented in dimensionless forms. 
The heat-exchange processes are found to be governed by the third-order characteristic 
equations with complex coefficients, and can be solved numerically using a graphical 


method. 


The numerical examples are presented to show the clear difference of frequency response 
for such cases when no solid capacity exists. 

The analog computer and the experimental results are found to be in good agreement 
with the theoretical results. 


Introduction 


URE PARALLEL AND COUNTERFLOW heat exchangers 
have been analyzed exactly [1, 2]! and their controllability has 
been extensively discussed in the literature [3, 4, 5]. For multi- 
pass heat exchangers, many kinds of researches on the mean tem- 
perature difference [6 to 10] and on steady temperature distribu- 
tions [11] have been published, but there have been few on dy- 
namic analysis. 

However, from the control engineer’s point of view, process 
dynamics is one of the most important subjects. 

The dynamic response of the system is dependent not only on 
the mean temperature difference, but also on other system 
parameters such as fluid velocities, heat capacities of the fluids 
and tube-walls, and so forth. The transient response seems quite 
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Nomenclature 


difficult to calculate, but the frequency response analysis is sim- 
pler and is used throughout this paper 

The author extended the P. Profos’s basic partial differential 
equation [12] and analyzed exactly the dynamic churacteristics of 
1-2, 3, 4, . . ., 2n, 2n + 1 pass heat exchangers. 

The third-order characteristic equations governing the heat- 
exchange processes in each multipass heat exchanger, and the 
transfer function formulas which involve the roots of the said 
equation are obtained and presented. 

Numerical examples of frequency response of each case with no 
solid capacity have also been shown. 


Basic Assumptions 


1 Fluid velocities, heat-transfer coefficients are all constant 
and do not change with the temperature of fluid or heat-exchange 
surfaces. 


2 Fach cross section is constant. 

3 Complete mixing in crosswise directions of each flow exists. 

4 The heat conductivities of tube and shell walls are infinite 
in the directions at right angles to the flow and zero in flow 
directions. 

5 There is no heat loss to outer circumferences. 


heat-exchange surface area, meter* 
kA/q, a, = kA/q,, dimensionless 
hA/q, a,’ = hA/q, a,’ = h,A/q, dimensionless 
hA/(Cv), by = hA/(Cyp1), = hi A/(Cyn), 6, = h,A/ 
(C,m), dimensionless 
= series and side capacity, kcal/meter deg C 
total length of shell, meter 
= film coefficient of heat transfer, h, tube-side, h shell- 
side, h, side capacity-side, kcal/meter* min deg C 
over-all coefficient of heat transfer, kcal/meter*® 
min deg C 
dead time, H/v = L, H/v; = Ly, min 
Y/H, dimensionless 
= variable in auxiliary equation 
= Pw, = keal/min deg C 
v,/v = L/L), dimensionless 
jo, j = 
= time, min 
= peripheral length of heat-exchange surface, meter 
= variable (see Appendix) 


— 1, dimensionless 
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variable (see Appendix) 
fluid velocity, », tube-side, v shell-side, meter/min 
heat capacity of fluid per unit length, kcal/meter 
deg C 
running length of shell measured from shell input, 
meter 
As parameters defined by Equation (3) 
B:, Bs, Bs parameters defined by Equation (3) 
0 temperature of shell fluid in lumped system, deg C 
6 temperature of fluid, @ shell-side, @,, 02, 3, . . . tube- 
side, deg C 
T t/L,, dimensionless 
¢ temperature of solid, g, series capacity, ¢, side 
capacity, deg C 
w circular frequency, dimensionless 


Subscripts 


1,2,3,... = tube-side, ne subscript = shell-side 
i = input, o = output 
h = series capacity, s = side capacity 
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The transfer function is taken to be the dynamic ratio of the 
outlet temperature of one fluid to the inlet temperature of another 
fluid. Considering sinusoidal solution, sinusoidal temperature 
variation is superposed on the steady-state temperature, so this 
definition is enough to consider. 


Fundamental Equations 


The following relation exists between a fluid at temperature 0 
and a solid surface at temperature g through which heat ex- 


change takes place: 
06 06 


Using nondimensional 7, J, and a’ gives 


06 
If the tube wall is infinitely thin, ¢ can be replaced by the 


temperature of another fluid, and a’ replaced by a. This funda- 
mental equation applies to 6, 6;, A, . . . in the following. 


1-2 Pass Heat Exchanger ( 13] 
Case P-C 


The “P-C” flow pattern is shown in Fig. 1. In this case, the 
tube fluid flows first from left to right, turns around at the right 
end, and finally flows back to the left, resulting in a parallel flow 
followed by a counterflow. The shell fluid is always assumed to 
flow from the left end to the right end. Both inside and outside 
tube walls are assumed to be infinitely thin so that there will be 
neither series nor side capacity to be considered. 

Assuming equal dimensions and conditions for each tube pass 
and equal U for shell and tube fluid, the fundamental equations 
are 


08, 


— 9) + — 8) 


Gpc- at( = 


= Bs + (a2 — — a8; je”? + (as — 2B; 


The solution of Equations (1) for a sinusoidal input signal (e*’ ): 

= + + (2) 
= (Cre?! + Cre?! + Cre? 


where C, C2, C; are the integration constants, and a, 8 are: 


ay ay 
a; =~ 
(3) 
a, a, 
+s +a, Pots +a; 


Pi, P2, Ps are the three roots of 
p*? + p{2a + rs) — p(s + a)? 
— s(s + a){r(s + a) + 2a} = 0 (4) 


The roots are all complex (see Appendix). 
Shell-Side Input: The C,, C2, C; can be determined from the 
following boundary conditions [Fig. 2(a)]: 


0: 0 = 04 = BC; + BC. + BC, = 0 
is 6, = or B,Cye™ + 


= a, Cie”! + + asCye?* 


l 


l 


The substitution of the results thus obtained into Equations (2) 
and solving for 62/0, [=Gpc-s(s)] gives the transfer function. 


8 | L J 


Fig. 2 Case P-C (a) shell-side input; (b) tube-side input 


— Pr — + [as — Bs — (ar — + — Bi — (as — 


= 


— BiXBs — + (a2 — BrXBi — Bile” + (as — — 


If input and output are exchanged with each other [Fig. 2(6)], 
we obtain the following transfer function through a similar pro- 
cedure: 


— 


a af 
6. SHELL SIDE FLUID 


l 
que 
7 BAFFLE PLATE’ 


CsCn | 
nin, TUBESIDE FLUID 


Fig. 1 1-2 pass heat exchanger showing symbolism 


Boje” + (a2 — Br — 


(6) 
+ (ax — — Bile” 


Static Characteristics: This is the case when s = 0 in Equa- 
tions (5), (6). For s = 0, from Equation (4), three roots are 


=0, =-at Vat + a,? 
which give 


aq, ) 
=1, a= | 
a, — a — Ps | 
ay a | 
= 1, - — 
a, + Bs a + Ds } 
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Introducing these values into Equations (5), (6), and rearrang- 
ing, we obtain 


2. 
Gpc-s(0) = 


+ VJ coth VJ 
and 
2a 
= + Vd coth VI 
where 


J =a’? +a;? 


(5a), (6a) are the static characteristics known as temperature 
efficiency of the heat exchanger [14]. 


Case C-P: 


Counterflow at input side, parallel-flow at output side (see 


Fig. 3). The fundamental equations are as follows: 
26; 08; = a(0 — 
or ol 
06. 
or + — 6) (7) 
06 06 
6-0 
a 
Ir | 4f 
t-0 
(a) (dD) 


Fig. 3 Case C-P (a) shell-side input, (b) tube-side input 


By solving these equations, the third-order characteristic equa- 
tion identical to Equation (4) can be obtained. The tempera- 


tures are: 
= + + | 
6, = + + (8) 
= (Cye™! + Cre?! + Cre?™ 


/ 


where C;, C2, C; are the constants of integration, a, 8 can be ex- 
pressed by the same equation as (3). Through the same proce- 
dure as before, we get 


With Side Capacity. Assume that the side capacity (thermal 
capacity of shell wall) has infinite thermal resistance in the axial 
and no thermal resistance in the radial directions. 

In the case P-C (same as C-P), for example, the fundamental 
equations are (see Fig. 4) 


06, 
ar + al = — 
06 06. 


> 
28 
+ = — 8) + 8) + 8) 


or = b, (8 — ¢,) 


In this case, the characteristic equation (4) changes somewhat 
and takes the following form: 


's 


pt (20 + +) 


— p(s + a)? 
— a(s + a) + a,) + 2a + (s + (12) 
s+b, 


With Series and Side Capacity. Assume that the series (solid 
capacity of tube-wall) and side capacity have infinite thermal re- 
sistance in the axial and no resistance in the radial directions. In 


the case of P-C, for example, the fundamental equations are (see 
Fig. 5): 


rN = — | 
= — 62) | 
+ = — 0) + — 8) | 
+a,(¢,- (13) 
= — gm) + — gm) 
= — gas) + — | 
= — ¢,) | 


G. (s) = (a8, — XB: — at; + — 00233) By je? + Ps + — — 
ICP-st 


9 
a [B82 — az — (8; — as + — as (8; a Je" + — a — (B: — (9) 


Gcr-t(8) = 


0 
an (Bs - as) + — a; — — + as[ — a, — — Jem (10) 


The static characteristics are, from equation (9) 


2a; 


Ger-s(0) = VJ coth VI 


from Equation (10) 


2a 


a +a+ coth VJ 
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Fig. 5 With series and side capacity 


= 
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From these equations, the third-order characteristic equation 
with complex coefficients can be obtained. The result is: 


| 6. 
3 2 _4,'8 2a'(s | 6 | 
pit ro + +b; + by 


bi + by 
+ bi) ( 
[e+ 8+ 
ai(s + bi) 


and Equations (3) change to 


) 
sth +b 


+ 2a’ 


by 

s+ 

s+ + be 


+24 


The transfer functions may be obtained through a similar pro- 
cedure. 


1-3 Pass Heat Exchanger 
Case P-C-P: 


This is a parallel-counter-parallel flow heat exchange process. 
The fundamental equations are as follows, when no solid capacity 


Fig.7 Case C-P-C, 1-3 pass heat exchanger 


The temperature variations of each fluid when input fluid tem- 
perature change is sinusoidal (e*” ) are: 


6, = (B,Die™ + + + Age?” + faz — — (as — — 


where D,, D2, D3, Ay are the constants of integration, and Ps 
are the roots of 


= 1) 


p? + p(3a + rs) — p{(a + 8)? + aa} 


d 
= = — 43) 
= + 8) (19) 


The a, B are given by Equation (3). 
If shell input is considered, from the similar boundary condi- 


= — 0) + — 6) + a(@ — 8) 
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tions as 1-2 pass type, as shown in Fig. 6(b), the constants of in- 
tegration are determined and the transfer function Gpcp-(s) = 


exists: 6/8; is obtained: 
06, Grcr-st(8) = [(a28; — — — +P) 
= 
20. + (a8; — — — /A, (20) 
or al where 


265 20s _ — 0s) A; = e?(a, — B:)[2(B: — Bs) — (a2 — as)) 

or al + e?(a2 — B.)[2(8s — Bi) — (as — m)] 

+ — B:)[2(8: — B:) — (a1 — 
+ — + — 8) + — 8) + — Bs) + — B:) + — 


Changing input and output relations as in Fig. 6(c), we get 


= [far — B, — (a2 — (et — my 


= _ /A 22 
(a, Dye”! + + aD ( 7) + {as Bs (a Bi)}(e ( ) 
1 
Case C-P-C: 
6; = (B:Dye”" + + Age?" 
‘ This is the case of counter-parallel-counter flow heat exchange 
= (Dye?! +. Dye?! + Dye? 


process, see Fig. 7(a). The fundamental equations are: 


Transactions of the ASME. 


(21) 


(a) 
6 6. 
(b) (c) 
Fig. 6 Case P-C-P, 1-3 pass heat exchanger 
@: 
6 
an! 
b 
| 
im 


From which we get 
0, = + + + 
6, = + + 
= (aDye™ + + — 
= (Dye?! + + 


(24) 


where D,, D:, D;, As are the integration constants and p,, ps, ps are 
the three roots of 
p* + + rs) — p{(a, + 8)* — aa} 

— s(s + a;){r(a, + 8) + 3a} = 0 (25) 


and 


P=at+s (26) 


a, B can be expressed by Equation (3) as before. 

Shell-Side Input: From the boundary conditions shown in Fig. 
7(b), we get the following transfer function through a similar 
procedure as before: 

Gorc-e(8) = [(2a3 — — — 
+ (2a; — — — 
+ (2a2 — — — (27) 


where 


Ay’ = — a28;) + 08; — 
+ e+ — + e™[e?(2a, — Bias: — Bs — a2 + Bs) 
+ e?(2a, — B2)( — a+ 
+ e?(2a; — — Bs — a: + 
Tube-Side Input: From the boundary conditions in Fig. 7(¢), we 
get the following result: 
Goro-w(s) = [fai — — (as — t Ps) 
+ {ae (as (emt Pe emt Psy 
+ fas — Bs — (a: — — (29) 


(28) 


1-4 Pass Heat Exchanger 
Case P-C-P-C: 


Assuming as before, the fundamental equations are: 


‘= a,(6 — 


= — 


= a(6, — 0) + — 80) + a(@; — 8) 
+ — 6) 
from which we get 

0; = (8,Eye™! + + + Agee” 

0, = + + + Bye?" 

6; = + — Age” 

= (a + aE e™ + — Bye™)e** 

= (Eye™ + + Eye?" 
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6. 
(c) 
Fig. 8 Case P-C-P-C, 1-4 pass heat exchanger 


Fig.9 Case C-P-C-P, 1-4 pass heat exchanger 


where E;, E:, E:, As, B; are the constants of integration, a, 8 are 
obtained from Equation (3), and p;, ps, ps are the three roots of 
the foliowing equation: 


p? + + rs) — + 8)? 
— s(s + a){r(a. + 8) + 4a} = 0 (32) 


(33) 
(34) 


ps = —(a, + 8) 
P=a+s 


Shell-Side Input: From the boundary conditions shown in Fig. 
8(b) we get the transfer function 
Grcrc-s(s) = 2(e* + — — )e™ 
+ (a2 — B:(ax8; — 
+ (a3; — Bs — (35) 
where 
Ay = e™((a, — a — a; — 28; Bs)}e” 
+ (a, — - & = 2(Bs - B;)}e™ 
+ (a; — B:){a, — as — 2B, — Bz)}e”* 
+ e™[(a, — 8:)(Bs — + (a2 — Bi — 
+ (as — — 


Tube-Side Input: See Fig. 8c). The transfer function is: 


Grcrc-te(8) = (e™ + e”)[{a2 — — (a; — 
+ {as (Q@ — 
+ B; — (a; — /A, (36) 
Case C-P-C-P 
The fundamental equations are, 


06; 


= = — 


26. 
= — G2) 
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06. 


(37) 


06 
= a( A, 6) + = 6) + a( Os 6) 


+ a(O, = 


The transfer functions can be obtained through the procedure 
described before: 


Shell-Side Input: See Fig. 9(b). 
= 2(e?* + e?*)[((as — — a8; 
+ (a — — Ps 
+ (az — — (38) 
Tube-Side Input: See Fig. 9(c). 


Gorcr-t(s) = (e? + e”)[fas — Bs — (a2 — B:)}e™ 
+ {ar — Bi — (as — Bs)}e™ 
+ faz — Be — (ay — (39) 
where pi, P2, ps are the roots of Equation (32); a, 8 are to be 
obtained from Equations (3); and ps, ps are given by Equations 
(33) and (34) respectively; and, 
A,’ = — Bs — (a2 — — B;)e”* 
+ fa: — — (as — B;)}(2a2 — Br)e™ 
+ fas — (a; — B;)}(2as — Bse™ 
+ e[aifas — Bs — (a2 — B2)}e™ 
+ afar — Bi — (as — Bs)}e™ 
+ — Bs — (a — B;)}e”"] (40) 


Cases With More Than Four Tube Passes 


The afore-mentioned procedures can be equally applied to the 
heat exchangers which have more than four tube passes. 

The procedure is rather complicated when the number of tube 
passes is odd. For example, let us consider the (2n + 1)-pass 
heat exchanger: 

The third-order characteristic equation is: 

For the types P-C-P 


p* +- p*{(2n + 1)a + rs} — p{(a + s)* + aa} 
— s(s + ai){r(a; + 8) + (2n + 1)a} = 0 (41) 
which corresponds to Equation (18). The remaining p-values 


are given by n, n — 1 repeated roots of p,, ps, respectively. 
For the types C-P-C 


“a 


+ p*{(2n + lja + rs} P{(a +s)? — aa,} 
— s(s + a){r(ai + 8) + (2n + 1)a} = 0 (42) 


which corresponds to Equation (25). The remaining p-values 
are given by n — 1, n repeated roots of 74, ps, respectively. 

For the (2n)-pass heat exchanger, the characteristic equation 
corresponding to Equations (4) and (32) is as follows for both 


p? + p(2na + rs) — + 8)? 
— s(s + a){r(s +a) + 2na} =0 (43) 
The remaining p-values are given by n — 1 repeated roots of ps, 
Thus the dynamic response of these multipass heat exchangers 
can be obtained through the use of these roots. The order of the 


equations never exceeds three so long as all tube passes are identi- 
cal and the number of shell pass is one. 
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If the number of tube passes approaches infinity, this method 
becomes very complicated, but on the other hand the temperature 
gradient in one tube pass becomes almost zero, therefore we can 
consider this case as a crossflow-type heat exchanger with both 
fluids mixed normal to the directions of flow, respectively. The 
static characteristics of this case has already been obtained 
[15]. Although the dynamic response of it is relatively easy to 
obtain, since the dead time may not essentially be equal, the phase 
characteristics can not be obtained. Consequently, we can use 
only the static response formulas to obtain the limiting value of 
d-c gain in this case, 


Numerical Example of Frequency Response 


Static and dynamic responses are compared using the obtained 
transfer function formulas neglecting the heat capacities of solid. 
A set of the system parameters used are: r = 1, total a; = total 
a = 2. In other words, equal heat-exchange conditions for all 
heat exchangers are assumed. 

Fig. 10 shows that the d-c gain patterns possess quite syste- 
matic relations each other. Note that the response of a heat ex- 
changer with more than two tube passes lies between C and P. 
Also note that, as the number of tube passes increases, the effects 
of counter and parallel flow in individual passes are averaged 
each other and the response approaches to the value when the 
number of tube passes is infinity. 

Fig. 11 shows the frequency response of 1-2-pass heat ex- 
changer. 

Note that the phase curves of Gpc—s(jw) and Gcp-t(jw) have a 
peak value and gain curves are oscillatory for a certain frequency 
range. P and C denote the responses of the pure parallel and 
counterflow heat exchangers, respectively. 

Fig. 12 shows the responses of the 1-3, 1-4-pass heat ex- 
changers for shell input. Note that the response curves may be 
oscillatory about w > 1.5. 

From Figs. 11, 12, following results may be observed and sum- 
marized: 


1 The phase lag increases with the number of tube passes. 

2 In the very low frequency range, the gain approaches to 
the value given by the d-c gain and phase lag is small, which 
means the whole length of shell pass and tube pass is effective. 

3 In the medium frequency range, input and output rela- 
tions have a marked effect on the frequency response charac- 
teristics. 

4 In the high frequency range, both the phase and the gain 
curves become oscillatory for w > 1.5, which might be due to the 
interaction between shell pass and tube pass. 

5 The phase lag would be larger and the gain would be more 
attenuated when solid capacities are added in a practical heat ex- 
changer, especially at high frequency range. 


773.47, C (burecouterfiow) 
- 4.88. C-P-C 

4-- 5.09, P-C,C-P 

- $.15. P-C-P-C ,C-P-C-P 

.~ $53, P-C-P 


6.19.P Cpureparaliei-tiow > 
~~ 7-96.M Cbotn fluid mixed case) 
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Fig. 10 Comparison of static characteristics of multipass heat exchangers 
forr = 1, total a, = totala = 2 
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Fig. 11 Frequency response of 1-2 pass heat exchangers for r = 1, 


Fig. 12 Frequency response of 1-3, 1-4 pass heat exchangers for shell- 
side input, r = 1, total a, = totale = 2 


Analog Computer Test 


Although several analog circuits for a pure parallel and counter- 
flow heat exchanger have already been published [16, 17], the 
author devised a simpler method using an ordinary analog com- 
puter to investigate the dynamic response. 

Considering case P-C and dividing the heat exchanger with no 
wall capacity as shown in Fig. 13(a) to four equal lumped sys- 
tems, gives 


Fig. 13(c)-1 


d6; 


+ (a, + = 6; + 
dr 


d6, 


+ (a, + 1)0, = 6, + a0, 
dr 


6 
+ (a, + 1)6; = + 
dr 


Fig. 13(c)-2 
28 &@ 

q, 
q 13) 

6. 

Fig. 13 Analog-computer test, (a) lumped-poarameter model of 1-2 pass 


heat exchanger; (b) analog computer circuit; (c)-1 indicial response; and 
(c)-2 frequency response for sheil-input 
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+ (a, + 1)05 = + 


— + (a + = + 


+ (a; 1)6, + a0. 


(44) 


= + (a, + = 0, + ( Cont.) 


+ (2a + 1)0, = 0; + + 
dO; 
+ (2a + = + + 


0 
+ (2a + 1)0; = O, + + 


= 0; + a(@ + 05) 


+ (2a + 1)O, 
dr 


The analog computer circuit using the same system parameters 
as in the former numerical example is shown in Fig. 13(b). 

Fig. 13(c) shows the frequency response and the transient re- 
sponse for shell-side input. In some frequency ranges the phase 
curve shows evidently a peak value, which would approach the 
one for the exact distributed system as the number of lumps in- 
creases. 


Experimental Results 


The frequency response of the multipass heat exchanger is 
quite different from the others for 1-2 pass as has been demon- 
strated by the numerical example of Figs. 11 and 12. 

The author has made experiments on a two-pass heat ex- 
changer as shown in Fig. 14. The length of the heat exchanger 
was chosen as 10 meters so as to cover theoretical peak of the 
phase shift curve. The heat-exchange process is water-to-water. 
The shell material was “polynite”’ in order to decrease the effects 
of side capacity: Tube pass was thin (0.3 mm) copper tube. 
The test conditions are: 


HEAD TANKS 
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5000 mm ROCK WOOL 
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Fig. 14 Schematic diagram of experimental apparatus, frequency- 
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Fig. 15 Experimental results, (a) frequency responses for shell-side 
input, (b) frequency responses for tube-side inpur 
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Fig. 16 Indicial responses 
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For 
tube-side input 
2.86 
0.43 


6.12 
0.81 
0.04 


For 
shell-side input 
3.01 
0.45 


5.94 
0.92 
0.046 


70.8 
68.7 
0.25 


62.5 
70.8 
0.22 


0.368 sq meter 

8.815 10~* keal/meter deg C 
122.7 K 10~* kcal/meter deg C 
10 meter 

H/v, = 1 min 

v = 10 meter/min, r = 1 
0.102 keal/meter deg C 

0.682 kcal/meter deg C 


Temperature was recorded by Yamatake-Honeywell Elec- 
tronik Potentiometer with bare Cu-constantan wire (0.2-mm @) 
as the temperature sensing elements, the response speed of which 
is sufficiently high for the dynamic response measurements. 

The experimental results which involve the effects of series and 
side capacities are shown in Figs. 15, 16. 

Fig. 15(a) shows the frequency responses for shell-side input, 
(b) for tube-side input. Favorable agreements are seen between 
the theoretical and the experimental results. Four cases in Fig. 
15 each corresponds to the four cases in Fig. 16 which shows the 
transient responses. 

Note that the phase shift characteristic of case a is quite 
similar to that of case d, but the gain characteristic is different; 
this may be due to the difference between the heat capacities of 
the input-side fluid and output-side fluid. 

Although the same remarks can be said for the cases c, b, the 
effects of large dead time are remarkable. 


Control of 1-2 Pass Heat Exchanger 


As can be seen from the experimental results, the choice of in- 
put and output are especially important for control applications. 
This fact can be evidently seen in the closed loop response as 
follows: 

Fig. 17 shows the responses for shell-side input when P and PI 
control actions were used. 


POINT CHANGE 


Pi0%.i 
P 10x 


4 


-SET POINT CHANGE 


P 40% 


Fig. 17 Closed-loop transient responses to step-function disturbances, 
upper diagram shows the responses for a in Fig. 15(a) 
lower diagram shows the responses for b in Fig. 15(c) 
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Above figure in Fig. 17 resulting from the case a in Fig. 15 
shows very stable response; lower one from the case b in Fig. 
15 shows far unstable response. 
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A technique for obtaining the roots of the characteristic equa- 
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Fig. 18 Avxiliary curves to obtain the roots of the equation (47) 


tion is to be described. In general, the third-order characteristic 
equations with s = jw may be written in the form 


+ + jm) + p(m + + ms + jns = 0 (45) 


where m’s and n’s are real numbers, and are a function of the 
variable w. 

Substituting p = u + jv into the equation, and equating both 
the real and the imaginary parts to zero yield the following two 
equations: 


+ u2m, + — 2vn, — 3v*) 
— — + m = 0 
(46) 
+ + — 2um, — 3u?) 


— un, — une — = O 


p? + p(2.89 + 74.16) 


| 10+ 


Fig. 19 Three roots of the characteristic equation on the complex plane 


Considering the region u > 0, v > O only, and expanding 
scales, one obtains p; = 0.89 + j2.16. Next, divide Equation 
(47) with w = 2 by p — 0.89 — 72.16. 


— 3.4135 + 75.9448 


p — 0.89 — j2.16 ) + pX2 


p® — p(0.89 + 72.16) 
p(2.89 + 74.16) + p(3 ) 


p2.89 + 74.16) + p(6.4135 — 79.9448) 


Three pairs of curves can be drawn on the u-v complex plane from 
these two equations, and the three roots pi, p2, and ps can be 
found from the intersecting points of the curves. 

As an example, consider Equation (4) with r = 1, anda, =a = 
1. By letting s = jw, the equation becomes 


+ p{2 + jw) + p(w? —"1 — jw) 
+ 4w* + jw(w* — 3) = 0 (47) 


In order to find the values of p’s for w = 2, first, the pair of equa- 
tions corresponding to Equation (46) are found. 


u® + 2u? + u( —3v? — 4v + 3) — 20? +40 + 16=0 (48) 
+ 2v? + —3u? — 4u — 3) — 2u? + 4u—2=0 (49) 


By letting u = 3, 2, 1, 0, and so forth in Equation (48), two values 
of v’s that satisfy the equation are evaluated for each value of u. 
The result is the u-curves shown in solid lines in Fig. 18. The 
v-curves are obtained in like manner, and are shown in dotted 
lines in the same figure. The thtee intersecting points are the 
roots of the Equation (47). 

Although the accuracy of the p’s may be improved by expand- 
ing the u, v-seale about the points of intersection, the following 
technique used by the author is recommended for its simplicity. 
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p( —3.4135 + 75.9448) 
p( —3.4135 + 75.9448) + 15.878783 + j2.082288 
0.121217 — j0.082288 


In this division, 0.89 and 2.16 are modified individually so as 
to reduce the residual (b) as small as possible. The final results 
are 0.8950 and 2.1630. Then from the quotient (a), obtained 
through this procedure, the rest of the p-values can be easily ob- 
tained. The values are: 


= —2.5320 — 72.0902 
Ps = —0.3629 — 72.0728 


Although this division may appear to be rather troublesome, 
this is more efficient than obtaining each intersecting point sepa- 
rately. 

Applying the same procedure for each w, other p-values can be 
obtained as shown in Fig. 19. 


DISCUSSION 
P. D. Hansen? and A. Fux’ 


The writers congratulate the author for this fine and timely 
paper on heat-exchanger dynamics. The organization and solu- 


2 Senior Engineer, Microtech Research Company, Cambridge, 
Mass.; formerly Instructor, Mechanical Department, Massachusetts 
Institute of Technology, Cambridge, Mass. Assoc. Mem. ASME. 
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Table 1 Comparison of spatially lumped heat-exchanger models 
Note: The F’s cre unity gain monotone dynamic operators selected to facilitate analysis or computation. 
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tion of many complicated equations is particularly noteworthy. of exact frequency response solutions is to provide a validation 
There are three areas touched upon where the writers feel that a for approximate time (or frequency) domain models. 
contribution from their work may add to this substantial con- Spatial lumping may be visualized in terms of separate energy 
tribution. transport and energy transfer operations (see models 1-4 of 
1 The author indicates two vastly different philosophies used Table 1). The energy (or heat) transfer is considered as an in- 
in attacking dynamic system problems. Most attention in this teraction between two flow streams. If more streams are involved 
paper is directly toward achieving the frequency response (gain such as in the author’s computer model, the additional interac- 
and phase curves) by formally solving a classical set of partial tions may be characterized by additional transfer nodes. The 
differential equations. This method is useful as a tool for statics are entirely determined by the energy transfer character- 
analysis. However, the effects of parameter or configuration istics, energy transport (or storage) affecting only the dynamics. 
changes are not apparent, hence the analytical solutions have For this reason the coefficients of each transfer element may be 
limited value in design, The author’s frequency response method _ selected so that the local static behavior is matched exactly. 
results in a precise solution of a mathematical problem. However, The exact static solutions are most simply stated in terms of the 
in order to formulate the mathematical problem so that it can be causal model (No. 1). 
solved, a number of simplifying assumptions must be employed, 
such as spatial uniformity of physical parameters. The partial A, = (1+T)G 
differential equations must be linear in the independent variables B= (1-T)¢ 
and have constant coefficients in order to employ this method. 
Furthermore, a heat exchanger can rarely be considered un- 
coupled from its environment, i.e., inputs independent of the heat- 
exchanger behavior. The frequency response methods prove = 
quite awkward for any interacting systems that cannot be treated a +o 
as a simple cascade. a+<6e 
By time domain modeling, the second philosophy employed by B= — 
the author, it is possible to avoid these difficulties. Spatial parame- 7 
ter variations, flow changes, and more complicated configurations _ 
can be treated in a straight-forward manner. However, in order @ = G{configuration, 8, |T'|} 
to achieve a solution, space (and perhaps time) usually must be TI is a function of the mass flow rate-specific heat products. When 
treated discontinuously, therefore introducing the possibility of these products are equal (balanced statics), ! = 0. When there 
truncation error. The writers feel that the principal application is no spatial temperature change in one fluid (condenser), |I'| = 1. 
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Fig. 20 Static responses of several two-fluid heat exchangers 
1 Counter-flow, I' = 0; 2 cross flow neither fluid mixed, ! = 0; 3 cross 
flow one fluid mixed, ! = 0; 4 cross flow both fluids mixed, = 0, one 
shell pass, infinity tube passes, I! = 0; 5 one-shell pass, two tube passes, 
[ = 0; 6 parallel flow, —1 < I < 1, any configuration, | = 1, 


The static gain term, G, is given for several configurations men- 
tioned in this paper: 


a Counter-flow 


~ 14 coth (sT) 


b Parallel flow 


c One shell pass, two tube passes 


1 
G = 
1+ (8 VW1 + 


d One shell pass, infinity tube passes, and cross flow, both 
fluids well mixed normal to the flow 


1 
G= 


greater disagreement with the condenser solution than the 
balanced statics solutions. For complicated geometries, the 
local static behavior may not be known in advance. The proce- 
dure in this case is to select an element size small enough so that 
its static behavior is essentially independent of the local flow 
directions. Less than 5 per cent error in the local predicted be- 
havior results when G is chosen as: 


G 


(the exact solution for balanced counter flow) 
provided that the value of 6 for a single lump does not exceed 2/3, 
the local value of 8 is proportional to the heat-transfer area per 
lump and hence inversely proportional to the number of lumps. 
In order to also avoid excessive error during transient or non- 
steady-state operation, it may be necessary to choose an even 
larger number of lumps. The number of lumps, thus, depends 
on the frequency band for which the model must behave quanti- 
tatively as does the real exchanger and on the size of the exchanger 
measured in units. 

All of the models of Table 1 converge to the same answer as 
the number of lumps becomes infinite. In the interior of the first 
four system models there is a transport operator interposed be- 
tween every two transfer points on the same flow path. The 
static and dynamic operators become identical in the four cases as 
the value of 8 for an individual lump approaches zero. Thus 
differences among these four system models can occur in only the 
termination lumps, these producing negligible effect when the 
number of lumps is large. Faster convergence can be expected 
with the first two than with three and four because of the non- 
symmetrical way of handling the shell to tube and tube to shell re- 
sponses in the latter models. 

The convergence of the author’s computer model is much 
slower than the first four because the author’s parameters are not 
compensated for finite lump size. This was proved by examining 
appropriate expressions obtained for the transfer functions of 
these different models for a finite number of lumps. 

The writers favor the causal model (number 1) for analog, 
digital, or hand calculations. This model avoids unnecessary, 
local feedback loops making programming and computing simpler 
and preventing computer instability due to algebraic loops. In 
active analog computation a continuous time scale is employed. 
For this purpose the monotone dynamic operators, F, and F,, are 
chosen as simple lags having time constants 7, and 7, respec- 
tively. 

For digital and hand calculations a discontinuous time scale 
must be employed. For this purpose F, and F, are conveniently 
chosen as pure delays having delay times of 7, and 7, with the 
restriction that the ratio 7,/7, must be rationai. For compli- 
cated configurations analog and hand solutions become imprac- 
tical because of the large number of lumps required to character- 


These solutions and others are plotted versus B/(1 + 8) for 
balanced statics, [ = 0, in Fig. 20. It is noted that the solutions 
for all cross flow configurations and one shell pass, an even num- 
ber of tube passes, converge to the same trace at small 8, and this 
trace converges with the traces for parallel and counter-flow for 
still smaller 8. It is noted that the author’s well-mixed stage com- 
puter model does not converge very rapidly to the correct answer 
as 8 is decreased 

As the absolute value of I approaches unity, the statics for all 
configurations approach the condenser solution which has the 
same G@ as balanced parallel flow. The well-mixed stage ap- 
proximation moves in the opposite direction and is in even 
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ize the geometry. Either simplified approximate methods or 
digital computer solutions must be used. 

The time constants, rT, and 7,, can be corrected to include the 
effect of wall capacitance as well as the fluid transport times pro- 
vided the wall charging time, L,/(b; + be), is small. 
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Fig. 21 


Analog computer model 


One shell poss, two tube passes, P-C; (cases a and c). Balanced statics, ' = 0. 


If L,/ (bi + bz) is not small, the energy transfer becomes a dynamic 
operator, and the computer models of Table 1, column 4, no longer 
apply. This is generally the case with gas to gas exchangers. 
If L,/b, is not small, a two-parameter model for F,, must be em- 
ployed. 

2 The author has found experimentally and analytically (Fig. 
15 and 16) that the tube-side dynamic response to a shell-side 
input is quite different from a shell-side response to a tube-side in- 
put even though the statics are identical. His choice of flow rates 
tends to emphasize this difference. The static performance of his 
experimental exchanger is almost that of a condenser ! = 0.74 
(which behaves the same for all configurations), since there is 
nearly no axial temperature gradient in the shell fluid. This 
situation is rarely encountered in practice since it utilizes very 
inefficiently the heat-transfer area and the capacity of the shell 
fluid to absorb thermal energy. Furthermore, the value of 8 in 
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the experiment, 8 = 3.5, is larger than a practical design would 
allow since much of the heat-transfer surface is wasted. A far 
more practical situation results when I’ is equal to zero. In this 
case the steady-state temperature rise of one fluid is equal to the 
steady-state temperature fall of the other, hence the name, ‘“‘bal- 
anced statics.” 

The writers have studied a 10-lump analog computer model of 
the causal form for the one-shell pass, two-tube pass configuration 
using balanced statics (Fig. 21). It was found that the difference 
between the two cases, a and c, was almost insignificant when the 
statics are balanced, Fig. 22. Furthermore, the temperature re- 
sponse to a temperature input rarely occurs within a tight control 
loop because it is difficult to manipulate an input temperature. 

Far more frequently a flow rate (or valve position) is manipu- 
lated to achieve tight control of an output temperature. The 
outlet temperature response to a flow rate change does not exhibit 
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pure delay since the flow changes almost instantaneously through- 
out the exchanger. Furthermore, the transport time is fre- 
quently as much as ten times smaller than that of the author’s 
experiment. In this circumstance the temperature measuring in- 
strument may be slower than the exchanger particularly if the 
temperature probe is mounted in a well. In this case a rather 
primitive dynamic model matching only the exchanger low fre- 
quency behavior may be sufficient. 

3 The solution for the case of cross flow, both fluids well- 
mixed normal to the flow, is identical to the limiting case of one 
shell pages, infinity tube passes, dynamically as well as statically 
provided that the over-all transport times as well as the static 
parameters, 8 and I’, are matched. This can be seen from Fig. 23 
where the temperature change and transport time per tube pass 
are infinitesimal. The frequency response solution for this case 
is far simpler than those of the author since a separate cubic 
equation does not have to be factored for each point on the fre- 
quency response plot. 

The starting differential equations for the cross flow case are: 


1 
-+m,,0, = ma 6,dy 
0 


1 
+ m0, = maf 6,dz 
oy 0 


where 


0, = 6, = 4,(y) 


These may be solved by standard methods to give the tempera- 
ture at any point in the exchanger as a function of the dynamic 
operators, Mag, Mary Moy, Mray 


and 


Bain 


4 


Fig. 22 Analog responses to unit step temperature inputs 
10 seconds (for each of ten lumps G = 0.166,7 = 1 sec). One shell pass, 


two tube passes, P-C (cases a and c). 
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For the special case of radially lumped wall and shell capaci- 
tances [see equations (13) of the paper] the m’s are given by: 
na;'(1,D + bi) 
= 
a,'I,D 
+ b, 


na'(I,D + be) 
= rL,D 
LD + be 
na,'b; 
+b, + bs 


a'be 

= 

+ by + be 
where the nondimensional time domain operator, s = 0/dr 
of the author, is replaced by the real time domain operator, 
D = 9/dt, where r = t/L,. Otherwise the symbols are the same 
as the author's. The limiting process of making the number of 
tube passes, n, approach infinity is accomplished while keeping 
the over-all parameters nL, r/n, na;’, na’, b;/Ly, b,/In, and b,/Iy 
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Cross flow. Both fluids well mixed 
normal to the flow. 


Fig. 23(b) 


One-shell pass. 
passes. 


infinity tube 


at their appropriate finite values. For any finite number of tube 
passes this solution is not exact, but it is approximately valid for 
any even number or large odd number of tube passes, provided 
the statics are nearly balanced. It is the writers’ opinion that 
significant discrepancies result only when § is greater than 3 
(which would be poor static design, see Fig. 20) or when very high 
frequency behavior is important. When the statics are unbal- 
anced the configuration plays a far stronger role in determining 


Transactions of the ASME 


r 
20 40 60 8° 
TIME SECONDS 
3 
{ { { 
FLUID 
ad SHELL 
FLUID 
—_ | | 
' 
H 
i 


system dynamics as was pointed out earlier. Therefore for un- 
balanced statics the cross flow solution can apply only when a 
large enough number of tube passes is employed so that the shell- 
side output for a tube-side input is nearly identical to the tube- 
side output for a shell-side input. 


Author's Closure 


The author would like to thank Mr. Hansen and Mr. Fux for 
their interest and discussion: 


1 The author appreciates the discussers’ constructive re- 
marks on heat-exchanger dynamics. 

The author’s first objective was to seek an exact mathematical 
solution to his adopted model of heat exchangers. The value of 
exact solution is that it will enable one to understand the physical 
nature of heat-exchanger dynamics and it also will show the ac- 
curacy of approximate methods. These are very important for 
improving dynamic response. 

Although the mathematical forms are complicated and one can- 
not see the effects of parameter variations directly, as Mr. Hansen 
and Mr. Fux pointed out, the dynamic response at particular 
frequency may be obtained without much difficuly using three p 
values, because these auxiliary roots of parameter variation are 
clearly shown on the complex plane. 

2 The author's aim in the experimental measurements was 
focused on the verification of the theoretical analysis and the heat 
exchanger was chosen to investigate the oscillatory aspects of the 
frequency response. The object was satisfied but it is obliged 
that the static performance of the author’s exchanger is that of 
a condenser; discussers’ comments in this respect are interest- 
ing and noteworthy. 

Their remarks that there is almost insignificant difference be- 
tween cases a and c when the statics are balanced are due to the 
fact that they might select very high fluid velocity. When the 
fluid velocity is low, there always appears a difference of trans- 
port lag as the author showed by the experimental data. The 
author’s results show the most evident difference between a and 
c because shell and tube fluid have the same velocity, and statics 
are unbalanced. 

3 The discussers’ analog approximation is very interesting 
and of great practical value especially when another system lag is 
large, but in general it may be difficult to make an analog circuit 
with sufficient accuracy because it is a distributed parameter. 
Care must be taken when selecting the frequency range to be 
used. 

4 Discussers’ idea of infinite tube passes is very interesting. 
The limiting process of making the number of tube passes ap- 
proach infinity while keeping the transport time from tube input 
to output at finite value would show some different result from 
the author’s. 

5 The author tried to find the physical aspects of the cyclic 
pattern of frequency response which multipass heat exchangers 
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Fig. 24 A, B, C, and D in the lower diagrams each corresponds to the 
same signed tube-fivid in the upper diagrams. The value K is deter- 
mined by the totat hect transfer from the shell fluid to the whole tube 
passes. 


input shell fluid temp. variation 


phase curve 
F gain curve 
WwW, 


Fig. 25 Frequency response 
At w,, heat transferred and transported are subtracted 
At ws, heat transferred and transported are superposed 


have, as has been shown in the theoretical curves of Fig. 11 and of 
Fig. 15. 

This may be due to the interaction between shell fluid and tube 
fluid. The heat-exchanger dynamic response is governed by the 
fluid transport lag and heat-transfer lag as shown in the schematic 
diagrams of Fig. 24. 

When the transferred heat and the transported heat superpose 
or subtract at the output of the exchanger at some particular fre- 
quency input, the interaction appears. This is shown in Fig. 25. 
This effect of interaction when a cyclic signal is considered is 
usually much diminished by the heat capacity of tube wall. 


MARCH 1960 / 65 


¥ 


= 
Bi 
Ay 
4 
a 
4 
| 


Bang-Bang Versus Linear Control of a 
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Second-Order Rate-Type Servomotor 


A simple nonlinear scheme for controlling a second-order rate-type servomotor ts 
desciibed. This scheme is referred to as “‘bang-bang’’ control since the input to the 


servomotor is made to ‘‘bang’’ from its maximum value in one direction to its maximum 
value in the other direction depending only on the sign of an error signal. This 
bang-bang system oscillates in a continuous high-frequency, low-amplitude limit 
cycle. The nature of this limit cycle is studied by the describing function approxima- 
tion and by an exact method. The step and frequency response characteristics of the 
bang-bang system are discussed and compared with the characteristics of a simple 
linear system. It is shown that many aspects of the behavior of the bang-bang system 
can be predicted from rather simple considerations. 


Introduction 


RATE-TYPE SERVOMOTOR has been defined by 
Shearer [1]! as a servomotor with a rate of change of output pro- 
portional to the input during steady-state operation. This paper 
deals with a simple nonlinear scheme for bringing about closed 
loop control of a second-order rate-type servomotor. This scheme 
is shown in Fig. 1 and is referred to as bang-bang control since 
the servomotor input X is made to bang from its maximum 
value in one direction to its maximum value in the other direction 
depending only on the sign of the error signal E. This error sig- 
nal is simply a linear combination of the reference input signal 
and proportional plus rate feedback signals. 

Possible reasons for considering any bang-bang type control 
scheme might include (a) the possibility of faster dynamic re- 
sponse since full effort is made available to overcome even very 
small errors, (b) possible simplification of the construction of 
various components in the control loop, (c) possible elimination 
or reduction of problems caused by hysteresis, zero shift, and 
other slight nonlinearities present in all real “linear’’ systems. 

The idealized bang-bang system shown in Fig. 1 may be repre- 
sented by the following equations: 


1 Numbers in brackets designate References at end of paper. 

Contributed by the Instruments and Regulators Division and pre- 
sented at the Instruments and Regulators Conference, Cleveland, 
Ohio, March 29—April 2, 1959, of Tae American or 
CHANICAL ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, March 2, 
1959. Paper No. 59—IRD-13. 


Nomenclature 


_ ky (1) 
X = (D? + 2tw,D + w,2)D 


—k,Y¥ —kY =E (2) 


X = for E>0 | 
X = for E<0)} 


(3) 


These equations might, for example, represent an idealization 
of a pneumatic or hydraulic servomechanism. A thesis [2] by 
the author deals with the control of a pneumatic servomechanism 
in somewhat more detail than is presented in this paper. 


Limit-Cycle Behavior 


Consider the idealized bang-bang system described by 
Equations (1), (2), and (3) and assume that Y* is zero. The 
value of Y should then also be zero. If Y is zero (and remains 
zero) X must also be zero. However, for this idealized system X 
cannot be zero, and thus the value of Y cannot be zero at all 
times. The system operates in such a way, however, that the 
average value of both X and Y is zero. 

From symmetry considerations, then, X must be a square 
wave of amplitude equal to X max, and Y must be the response of 
the servomotor to this square wave. The steady oscillation of Y 
is known as a “limit cycle’’ and will be analyzed first by an ap- 
proximate method and then by an “‘exact’’ method. 

Approximate Method—Describing Function Technique. The describ- 
ing function is an approximate method for determining the fre- 
quency and arhplitude of oscillation of a limit cycle. This tech- 


constant depending on initial 


controller gain 


servomotor input variable 


conditions proportional feedback gain 


constant depending on_ initial 
conditions 

constant depending on_ initial 
conditions 

derivative with respect to time = 
d/dt 

Naperian log base 

error signal 

ky 

w, = a dimensionless measure 
of the rate feedback gain 


66 / MARCH 1960 
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nique is described in detail] in the literature [3, 4, 5] and is dis- 
cussed here only briefly and with respect to the idealized system in 
Fig. 1. 

The assumption made in the use of the describing function 
technique is that the square-wave oscillation of X may be re- 
placed by a sinusoid equal to the first component of a Fourier 
series expansion of X. 

Consider that the closed-loop system in Fig. 1 is broken by re- 
moving the nonlinear element. If X is now made to oscillate 
sinusoidally, E will also be sinusoidal. Since the nonlinear ele- 
ment is assumed to have no phase shift (i.e., hysteresis), the con- 
dition for steady oscillation of the closed-loop system is simply 
that when the loop is broken E will be exactly in phase with X. 
The amplitude of £ is unimportant, however, since X depends only 
on the sign of Z and not on its magnitude. It is now convenient 
to define 


T = the time required for the limit cycle oscillation to com- 
plete a half cycle 


and 


k 

H = —w, (4) 
ky 

where 


H =a dimensionless measure of the rate feedback gain 


Using these definitions and a relationship between E and X ob- 
tained from Equations (1) and (2) 


= 1 —2tH (5) 
T 


This equation relates the limit-cycle frequency (actually the time 
for half a cycle) to the system parameters ¢ and H. 

Consider a system with fixed ¢. At H = 0, the value of Tw, 
is . Therefore, in the absence of rate feedback, the limit cycle 
oscillation occurs at the natural frequency of tne system. As H is 
increased from zero, Tw, decreases, and at some value of H, Tw, 
goes to zero. At this point the frequency of the limit cycle is in- 
finite and thus the amplitude of oscillation is zero. According to 
Equation (5), Tw, becomes zero when 


= 1 (6) 


Thus, when ¢ and H satisfy Equation (6), the limit cycle oscilla- 
tion occurs at infinite frequency and zero amplitude. 

Exact Solution. The “approximate” analysis is based on the as- 
sumption that the limit cycle oscillation of Y can be approxi- 
mated by a pure sinusoid. The analysis to be described, how- 
ever, represents an exact solution within the assumptions implied 
by Equations (1), (2), and (3). 
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Fig. 2 Limit-cycle oscillation 


In the limit cycle, X is a square wave of amplitude Xmsx. The 
output Y is the response of the servomotor to this square wave. 
Thus Y is not really sinusoidal and in this analysis is not assumed 
to be so. 

Fig. 2 shows the approximate wave forms for X, Y, Y, and Yin 
the limit cycle. Consider the time interval between ¢ = 0 andt = 
T. During this interval X remains constant at the value +X max. 
Now since ¥ follows X by a quadratic lag, Equation (1) may’ be 
written for 0 <i< Tas 


(D? + 2fw,D + w,")¥ = mex = const (7) 


The solution of this equation is 


+ Che") cos V1 — + (8) 
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where 


C,, C1 = constants determined by the initial conditions at 
i=0 


The output position Y in this interval is simply the integral of 
Equation (8). Thus 


+ [-c, Vi — — Cet] cos — 


kX mes 


t+Cs (9) 


w,,? 
where 
C; = constant depending on the initial conditions 


Furthermore, ¥ in this interval is simply the derivative of 
Equation (8) 


Fig. 2 indicates that, by symmetry, 
-Y(T) 
-Y¥(T) > 
-¥(7) } 


Since X jumps from —X max to +X max at ¢ = 0, the value of E 
must be zero att = 0. Then, from the summing arrangement 


Y(0) 
= 
y(0) = 


(11) 


k,Y* —k,Y¥ —k¥ =0 (12) 


or since Y* = 0 


k,Y¥+khY =0 (13) 


If Equations (8), (9), and (10) are now substituted into Equa- 
tions (11) and (13), a set of four equations in four unknowns is 
obtained. The solution of these equations leads to the following 
resulta: 


H = 
Vi-$ 

In Fig. 3 this relation is compared to Equation (5) which is the 
corresponding result obtained by the describing function. The 
solid curve represents Equation (5) and the plotted points repre- 
sent Equation (14). Fig. 3 indicates that the describing function 
is an extremely good approximation to the exact solution. 

The relation between [ and H at Tw, = 0 is, from Equation 
(14), 

2¢H =1 (15) 

This result is identical to Equation (6) and thus indicates that 
the describing function is, in fact, ‘‘exact’’ at this limit. 

The analysis of this section has dealt with the “steady-state” 
nature of the limit cycle. This steady state is approached in some 
asymptotic fashion after a disturbance. Whenever 2¢H equals 
or exceeds unity the asymptotic value of Tw, is zero. The greater 
the value of 2¢H, however, the faster this asymptotic value is 
approached. 

Stability Criterion for Bang-Bang System. The bang-bang system is 
always unstable in the sense that the output Y is continuously 
oscillating as described. The amplitude of this oscillation, how- 
ever, goes to zero as the frequency of the oscillation becomes in- 
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finite. This frequency becomes infinite when 7, becomes zero, 
or in other words when 2¢H becomes unity. This fact is used 
to define a “stability criterion’ for the bang-bang system as 
follows: 

The bang-bang system is “stable” if 25H 2 1. Stable here 
indicates that the ideal steady-state limit-cycle amplitude of Y is 


+ 


° 


0.5 
H 


Fig. 3 Comparison of describing-function solution to exact solution 


sin V1 — f¢w,T + — cos V1 — — + [2x + Ce "| cosh 
sin V1 — — (eT) + cosh fw,T 


zero. For 2¢H < 1 the steady-state value of this amplitude is 
finite and thus the system is considered as ‘“‘unstable.”’ 

Real System. In a real system the limit cycle frequency does not 
approach infinity although 2¢H may be much greater than unity. 
This is due to the fact that the idealized model of the system 
shown in Fig. 1 becomes inapplicable at high frequencies and low 
oscillation amplitudes. 

If the limit cycle frequency is high compared to w,, however, the 
amplitude of oscillation of Y will be small. This fact is demon- 
strated by the analog computer results shown in Fig. 7. 


Simplified Analysis for Small Changes in Input Signal 

The analysis of nonlinear systems is complicated by the fact 
that, in general, the principle of superposition does not apply, and 
thus the response of a system to one input cannot ordinarily be 
determined from the response of that system to some other input. 
For this reason it is usually more difficult to describe the behavior 
of nonlinear systems than the behavior of linear systems. For the 
simple bang-bang system discussed in this paper, however, it is 
possible to make an approximate analysis which leads to a very 
simple but surprisingly accurate description of the dynamic be- 


Transactions of the ASME 


SOLID CURVE-DESCRIBING FUNCTION 
at = PLOTTED POWTS-EXACT SOLUTION 
4 + + + 
tt 
| 
af 
| 
| 
+ + 
|_| 
_| | 
| 


=x 


Fig.4 Approximation to idealized bang-bang characteristic 
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havior of the system if only small variations of the input signal 
are allowed. 

This simplified analysis is brought about by considering the 
nonlinear characteristic in Fig. 1 to be a limiting case of the 
characteristic, shown in Fig. 4, as k, is made larger and larger. 
For small values of E, this system is linear and is described by the 
following equation: ; 


+ 2fw,D* + (w,? + kkk,)D + kkk Y = (16) 
Application of Routh’s stability criterion to the characteristic 
equation of Equation (16) indicates that the system is stable when 


2fw,(w,? + kk ky) 

we > (17) 
k kek, 
Since k, is very large (ideally infinity), w,* is negligible in com- 
parison with k,k,k, and k,k, may be cancelled. Use of Equation 
(4) then leads to 


> 1 (18) 
This result is in exact agreement with the results obtained in the 
preceding section and serves to justify the use of this simple 
analysis, 

When the bang-bang system is stable (i.e , 20H > 1)small values 
of £ are insured if only small changes of Y* are allowed. In this 
case, the approximate analysis again makes use of Equation (16). 
However, since k, is very large (ideally infinite), the higher order 
terms are negligible (except at very high frequencies) and thus 


+ kkk = (19) 


Canceling and making use of Equation (4) 


[2 D+ 1] (20) 
w, 

This equation suggests that when only small variatiohs in Y* are 
allowed the behavior of the bang-bang system can be approxi- 
mated by a first-order lag with time constant equal to H/w,. 
This simple result appears to be quite meaningful since it is in 
good agreement with the results of the analog computer study dis- 
cussed in the following sections. 


The preceding section indicates that the response of the bang- 
bang system to a small step input will be similar to the response of 
a first-order Jag. This simple result, however, cannot be ex- 
tended to describe the behavior of step inputs of larger amplitude. 
In these cases it is usually necessary to resort to numerical, 
graphical, or analog computer solutions. 

In order to establish some standard by which the response of 
the bang-bang system may be evaluated, a simple linear system 
is also considered. This linear system is identical to the bang- 
bang system in Fig. 1 except that the nonlinear characteristic is 
replaced with a proportional gain k,. This linear system can be 
described by Equation (16) although for the present purposes k, 
is not required to have a very large value. 

In order to permit a quantitative comparison of the step re- 
sponse behavior of the bang-bang and linear systems it is con- 
venient to establish some means by which the “speed of response”’ 
of the two systems may be compared. This is done by defining a 
“response time’’ 7, as: 


r 


- Shaded area in Fig. 5 (21) 


r Y,* 
where 


T, = response time 
Y.* = commanded step amplitude 


The shaded area in Fig. 5 is equal to |¥o* — Y| — |e~* Yo" 
where the integration is carried out over all time in which the in- 
tegrand is positive. 

This definition of the response time of the system is made arbi- 
trarily, but has the important advantages that the value of T, 
may be obtained directly as a voltage signal from an analog com- 
puter and that this value will not be affected by the low-amplitude 
limit cycle oscillation of the bang-bang system. 
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Fig. 5 Shaded area used in definition of T, 
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The value of J, for a first-order lag is easily computed as fol- 
lows: Assume that a step of amplitude Yo* is applied to a first- 
order lag which has a time constant r. The output response of 
the system is then given by Yo*(1 — e~‘/"). Use of Equation 
(21) then leads to 


T, = 0.8217 (22) 


The value of 7, for the linear system described by Equation 
(16) is clearly a function of the parameters appearing in that 
uation. Thus, 


T, = SE, for the linear system (23) 


For the idealized bang-bang system in Fig. 1 only the ratio of the 
rate and proportional feedback gains is important but since the 
system is nonlinear the value of 7, becomes a function of the in- 
put step amplitude. Thus: 


T, = fe €, Wn, H, Yo*, Xmax) for the bang-bang system (24) 


These equations may be simplified by nondimensionalization- 
Thus: 


= fi(f, Ky, K,) for the linear system 
2fw,7T, = ff, H, yo*) for the bang-bang system 


(25) 
(26) 


where 


K, = a dimensionless parameter representing k, 
‘y = a dimensionless parameter representing k, 
yo* = a dimensionless parameter representing Yo* 


Since ¢ and w, are to be held constant in any comparison be- 
tween the linear and the bang-bang systems, it is equally valid to 
compare values of 2(w,7', as to compare values of 7, directly. 

Equations (25) and (26) indicate that the response time of each 
system depends on three variables. A complete comparison under 
all conditions is therefore a rather involved procedure especially 
since not all three variables are the same in both cases. In order 
to simplify the procedure but still obtain the most interesting re- 
sults, it is convenient to make certain assumptions which lead to 
“minimum values”’ of 2¢w,7', in both Equations (25) and (26) as 
functions only of ¢. 

This minimum value represents the minimum value of 2{w,7', 
which can be obtained by variations in only k, and k, subject to 
certain restrictions on the stability of the system. Perhaps the 
most simple restriction on the stability of a step response is a 
limit on the maximum allowable overshoot. Such a restriction is 
made by arbitrarily selecting the value of the maximum allowable 
overshoot to be e~ * Yo*(i.e., about 4.32 per cent of the step ampli- 
tude). Thus. in terms of Fig. 5, any step response having shaded 
area above the commanded value is considered to violate the 
stability restrictions. 

For the linear system the minimum value of 2{w,7', as a func- 
tion of ¢ can be determined as follows: With k, (or K,) set equal 
to zero determine by trial of various values of k, (or K,) the 
minimum value of 2¢,7', which the system can display without 
violating the overshoot restriction. This procedure has been 
carried out by means of analog computation and the results shown 

n Fig. 6. 

It can be shown from Equation (16), however, that, if k, (or 
K,) can be increased from zero, the value of 2{w,7, at any ¢ can 
be reduced to the value shown in Fig. 6 for any lower value of ¢. 
Roughly speaking, this is due to the fact that increasing k, has, 
to a certain extent, the same effect as lowering £. It is unde- 
sirable to reduce the response time too much by this method, how- 
ever, since the responses corresponding to the low values of ¢ 
tend to have quite oscillatory impulse response characteristics. 
The results of the analog study suggest that the response time 
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Fig. 6 Response time for bang-bang and linear systems 


should be reduced by the use of rate feedback k, only if ¢ is 
greater than 0.5. Below this value seems to be a point of diminish- 
ing returns since the response becomes quite oscillatory without 
much improvement in 7. The minimum value of 2fw,T7’, for 
all values of { greater than 0.5, therefore, will be considered to be 
the value at { = 0.5. This is shown in Fig. 6. 

For the bang-bang system, the minimum value of 2{w,7', can 
also be found at each ¢ by trial of various values of H. As in- 
dicated by Equation (26), however, the value of the response time 
for the bang-bang system is a function of step amplitude. For 
purposes of comparison, the step amplitude for the bang-bang 
system is taken as the amplitude of the largest step which the 
(so-called) linear system could execute without saturation. This 
amplitude can be computed at each ¢ from the parameters found 
in the determination of the minimum value of 2{w,7, for the 
linear system. The minimum value of 2{w,7', for the bang-bang 
system is shown as a function of [ in Fig. 6. A typical step re- 
sponse is shown in Fig. 7. 

The variation of 2{w,7,, with step amplitude at the particular 
value of £ = 1.0 is shown in Fig. 8. The step amplitude is non- 
dimensionalized by dividing the actual amplitude by the ampli- 
tude used in the determination of Fig. 6. 

At very small amplitudes the value of 2fw,7, is seen to ap- 
proach the limit suggested by the small signal analysis of the pre- 
ceding section. Making use of Equations (20) and (22) 

T, = 0.821 = 


2fw,7T, = 0.821(25H) 
For the system of Fig. 8, = land H = 0.852. Thus 
2fw,T, = 1.40 
This limit seems to agree quite well with the data. 


Frequency Response 


The term “frequency response”’ in its usual sense has meaning 
only for linear systems and thus, strictly speaking, cannot be ap- 


plied to the bang-bang system of Fig. 1. It is possible, however, 
to excite this system sinusoidally and to approximate the output 
by a sinusoid of the same frequency and thus determine an ap- 
proximate frequency response characteristic. At small input 
amplitudes and low frequencies the small signal analysis leading 
to Equation (20) indicates that the system should behave like a 
first-order lag with time constant H/w,. This approximation 
must break down as the frequency is increased, however, since 
the system must eventually appear to be third order. The actual 
behavior of the system was determined by analog computation 
and the results for £ = 1.0 and H = 0.852 are shown in Fig. 9. 
In this figure the actual output amplitude (not amplitude ratio) is 
plotted as a function of the excitation frequency for several values 
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of input amplitude. The shape of the curves are very much as 
predicted by the simple theory as may be verified by noting the 
amplitude of each curve at w = w,/H has fallen to very nearly 
0.707 of the amplitude at zero frequency. At high frequency, 
however, the curves “jump”’ to a different mode in which X is 
simply a square wave of frequency equal to w. The output 
characteristics are then simply the open-loop characteristics of 
the servomotor. A predicted curve of these characteristics can 
be obtained by approximating the square wave at X by its first 
Fourier component as is done in the describing function tech- 
nique. 

It is interesting to note that the approximation suggested by 
Equation (20) seems to be applicable even for large input signal 
amplitudes. (The input amplitude of 0.424 represents the larg- 
est amplitude which the linear system with [ = 1.0 can execute 
without satui ation.) 


OuTPUT AMPLITUCE 


++ 
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Conclusions 


The results of this study indicate that a second-order rate-type 
servomotor can be controlled by means of the bang-bang scheme 
shown in Fig. 1. In many respects, this system can be considered 
as the limiting case of a linear system with very high gain in the Fig. 9 Frequency response of the bang-bang system—output amplitude 
forward loop. In particular, when only small changes in the versus frequency for four different input amplitudes, { = 1.0,H = 0.852 


+—+ 
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input signal are allowed, the system may be approximated by a 
first-order lag. When sufficient rate feedback is employed, the 
steady-state limit cycle oscillation of this idealized bang-bang 
system occurs at infinite frequency and thus with zero amplitude 
of the servomotor output. The speed of response of the bang- 
bang system is of the same order of magnitude as that of a linear 
system for large input signals but somewhat faster for small input 
signals. 
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The Design and Analysis of a Servovalve 
With Flow Feedback 


Almost all conventional servovalves operate on the principle of controlling the position 


of a four-way or three-way valve made with very high precision with the hope that the 
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flow rate will follow accurately the input signal to the valve. In this paper, an entirely 
new approach to the servovalve design is presented. A flow signal instead of valve 


position is used in the feedback loop. This design eliminates the necessity of a high 
precision valve, gives a greater length of the life expectancy of the valve, and minimizes 
the effect of load variation as well as supply pressure variation. Dynamic analysts and 
test results of the flowmeter and the complete valve are given. 


Introduction 


HE FUNCTION OF A SERVOVALVE is to control the 
flow of fluid power to a fluid motor (actuator) in accordance with 
the input signal to the valve (often electric of very low power 
level). The flow of fluid power can be regulated by controlling 
either the output flow rate (flow-control valve) or the output 
pressure of the valve (pressure-control valve). For most posi- 
tion control systems, a flow-control valve is often the best choice. 
Systems using flow-control valves, rather than pressure-control 
valves, are less sensitive to loads acting on the actuator, and thus 
have less position error due to load changes. No existing valve 
is a perfect flow control or a perfect pressure control. There is 
always some coupling between the load pressure and load flow. 
Graphs showing the relationships between pressure, flow, and 
input signal are often called pressure flow characteristic curves of 
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Ohio, March 29-April 2, 1959, of Tae American Society or Me- 
CHANICAL ENGINEERS. 

Note: Statements and opinions advanced in papers are to be 
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the valve. The shape of these curves influences both the static 
and the dynamic performance of the system. 

The simplest type of a flow-control valve is a four-way spool- 
type valve. The spool can be either positioned directly by an 
electromagnetic actuator (commonly known as a torque motor) or 
by a hydraulic position servo. In both cases, the spool position is 
the controlled output. The flow and pressure output depend 
very much on the accuracy of the spool geometry (i.e., clearance 
between spool and bore, overlap or underlap of the metering 
corners, squareness of the metering lands, and radius of the meter- 
ing corners). These geometrical conditions also influence such 
things as leakage rate, deadband, linearity, and threshold. 
Another consideration in valve design is that response is improved 
as the necessary valve stroke, for a given flow, is reduced. That 
is, valve response will improve as the spool flow gain is increased, 
or for a given response, the power required to drive a spool, of a 
given mass and size, will be reduced if a short stroke (high gain) 
spool is used. But the use of a short stroke valve makes the abso- 
lute dimensional requirements of the valve very severe. For an 
example, a typical high-performance 5-horsepower servovalve, 
operating on 3000-psi supply pressure, has about 0.015-in. stroke. 
The tolerance on the axial spacing of the metering corners must 
be less than 0.0001 in. This, and other critical dimensional re- 


Nomenclature 


a = distance between nozzle and 


A, 


A, 


A 


flapper fulerum, in. 

the upstream chamber cross- 
sectional area of the flow- 
meter, in.? 


= effective nozzle area, in.* 


spool end area, in.? 


= equivalent flapper damping co- 


efficient, in-lb-sec/rad 
feedback spring torque arm, in. 
effective flowmeter damping co- 
efficient, lb-sec/in. 
output spool discharge coeffi- 
cient, dimensionless 
flowmeter and valve discharge 
coefficient (assumed to be con- 
stant), dimensionless 
derivative with respect to time 
(d/dt) 
output spool diameter, in. 
effective diameter of the flow- 
meter, in. 
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flowmeter spring initial force, lb 

possible flowmeter friction force, 
Ib 

steady-state flow force acting on 
the spool, lb 

transient flow force of the out- 
put-stage valve, Ib 

nozzle force acting on flapper, 
Ib 

feedback force, lb 

torque motor input differential 
current, amperes 

moment of inertia of the flapper 
and armature assembly, lb- 
sec*-in. 

flowmeter flow force coefficient, 
Ib-sec?/in.‘ 

torque motor coefficient, in- 
lb/amp 

torque motor magnetic spring 
rate, in-lb/radian 

composite flapper spring stiff- 
ness, in-lb/radian 


output stage transient flow force 
constant, Ib-sec/in. 

flowmeter gain coefficient, sec/ 
in.? 

feedback spring rate, Ib/in. 

spool force coefficient, sec-lb!’?, 
in.? 

spool steady-state flow force co- 
efficient, lb-sec/in.* 

output stage flow coefficient, 
in.?/see 

armature and flapper spring 
rate, in-lb/radian 

nozzle pressure 
in.*/sec-lb 

nozzle displacement gain, in.*/ 
sec-radian 

power spool damping lengths, 
in. 

output spool mass, |b-sec*/in. 

effective mass of flowmeter, lb- 
sec?/in. 


constant, 


(Continued on next page) 
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quirements such as squareness of the metering edges with the 
axis, sharpness of the metering corners, have great bearing on the 
cost of the valve. Another problem is erosion. When the fluid 
flows past the metering corners at high velocity (over 300 mph 
for valves operating at 3000 psi supply pressure) small but hard 
particles in the fluid gradually erode the sharp corners and change 
the characteristics of the valve. 


2 | 
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Fig. 1 The characteristic curves of a typical precision-made closed center 
four-way valve 


The measured characteristic curves of a typical precisely made 
four-way valve with position input are as shown in Fig. 1.' It 
is noted that the load flow is not only a function of the input sig- 
nal (displacement), but it is also a function of the load. Further- 
more, when the supply pressure changes, the whole family of 
curves will change. Such characteristics can be detrimental to 
the operation of the servo system. The variation of gain due to 
cbange of load can be reduced by using a so-called flow gain com- 
pensation technique. This is done by using a force input rather 
than a position input. Under this condition, the position of the 
spool is equal to the input signal divided by the restoring spring 
rate. The latter is made up of two parts—the mechanical spring 
and an equivalent spring due to steady-state flow force. The 
shape of the characteristic curve depends on the relative value of 
the mechanical spring rates and the hydraulic spring rates. When 
the mechanical spring rate is reduced to about half the maximum 
hydraulic spring rate, the characteristic curves become almost 
horizontal over a wide range, and the valve approaches an ideal 


flow control. However, flow gain compensation has two major 
drawbacks: 


1 The flow force usually is quite low (for example, the maxi- 
mum flow force of a control valve with 10-horsepower output 
1J. F. Blackburn, ‘Contribution to Hydraulic Control—3, Pres- 


sure-Flow Relationship for Four-Way Valves,’’ Trans. ASME, vol. 
75, 1953, pp. 1163-1170. 


Nomenclature 


operating at 3000-psi supply pressure is about 10 lb). A rela- 
tively weak mechanical spring must be used. The force output 
of the first stage has to be reduced accordingly. This reduces the 
dependability of the two-stage valve, especially under dirty oil 
conditions. 

2 The hydraulic spring rate is a function of the supply pres- 
sure. It is possible to select the mechanical spring rate for a given 
supply pressure to get optimum compensation. However, when 
the supply pressure changes, the compensation is no longer per- 
fect. 


Control Valve With Flow Feedback 


Since the flow is the ultimate variable to be controlled, the natu- 
ral thing to do is to measure the flow through the load and use this 
signal in the feedback path. The advantages of flow feedback 
are: 


1 The accuracy of the second-stage valve becomes less im- 
portant. Theoretically, if the loop gain is sufficiently high the 
static performance is not affected at all by the amount of overlap 
or underlap of the second-stage valve. Furthermore, the metering 
ports can be circular or any other shape instead of rectangular, 
thus reducing the cost of manufacturing. 

2 - The effect of erosion does not exist since the performance of 
the valve is independent of the amount of underlap of the valve. 

3 Since the flow is the controlled variable, the effect of load 
variation, supply pressure variation, and back pressure variation 
can be reduced to small amounts by using sufficient valve loop 
gain. That is, the valve gain will remain relatively constant al- 
though there are variations in the other components of the hy- 
draulic system. 

4 Valve leakage can be reduced by using an overlapped spool. 

5 The use of large spool overlaps permits the use of large 
diametric clearances without increased leakage. Large spool 
clearances are advantageous in systems using relatively dirty oil. 


Flowmeter and Feedback Arrangement 


It is obvious that the most important single elernent of the 
system described is the flowmeter which should have the follow- 
ing requirements: 


Output signal should be a linear function of fiow rate. 
Good static response—low threshold and low hysteresis. 
Fast dynamic response. 

Calibration independent of viscosity of fluid. 

Bi-directional sensitivity. 

Mechanical output. 

Simple and compact. 


These requirements rule out all head-type meters, i.e., orifices, 
nozzles, venturis, pitot tubes, etc., since the output of these types 
of meters is proportional to the square of the input signal (flow 
rate). Their accuracy of flow measurement is very poor at low 
flows. However, in most systems greater accuracy is required 


= valve drain pressure, lb/in.* 


flow at a given operating con- 


fluid velocity by metering edges 


P, = pressure differential across the dition—assumed a constant, of the power spool, in./sec 
flowmeter, lb/in.? in.*/sec x = output spool position, in. 
P, = pressure drop across the load, Q: | nozzle-flapper valve output flow, y = flowmeter displacement, in. 
Ib/in.? Qs f ae in.*/sec yo = flowmeter displacement at a 
P, = valve supply pressure, Ib/in.? given operating condition— 
nozzle chamber pressure, Ib/in.? constant, in. 
P in-lb = armature and flapper move- 
Q, = flow output of the valve and the T, = flexure torque of armature, flap- ment, radians 


flowmeter, in.*/sec 
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per assembly, in-lb 


mass density of fluid, lu-sec*/in.* 
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at low flow rates than at the high flow conditions. Flowmeters 
using capillary-type restrictions have linear output but are sensi- 
tive to viscosity variations and consequently are very sensitive to 
temperature changes. 

The most suitable flowmeter for this application is the area 
type, a typical one of which is shown in Fig. 2. This type of flow 
sensing element satisfies all the requirements listed, except that it 
is only sensitive to flow in one direction. In order to measure bi- 
directional flow, two area-type meters are required. The problem 
then is how to sum up the discentinuous output of both meters, 
smoothly, accurately, and without any deadband. 
accomplished by using a force balance system as shown in Fig. 2. 
The displacement outputs of the fiowmeters are converted into 
forces by two feedback springs. Both forces are fed back to can- 
cel the effect of the input force (output of torque motor). The 
use of two area meters not only overcomes the unidirectional 
nature of the individual area flowmeters, but also provides a 
completely symmetrical arrangement; an important require- 
ment from the thermal stability point of view. The flapper and 
nozzle valve, shown in Fig. 3, serves as a high gain amplifier in a 
conventional manner. 


DISPLACEMENT 
Q 7 QUTPUT 
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Fig. 2 A typical area-type flowmeter 
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Fig. 3 Schematic diagram of the valve with flow feedback 
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Fig.4 Simplified block diagram of the servovalve with flow feedback 


Under normal operating conditions the force required to move 
the spool is relatively small. The flapper nozzle valve works al- 
most like an integrator. The approximation block diagram of the 
system is as shown in Fig. 4. It is obvious that other types of 
first-stage valves can also be used instead of the flapper nozzle 
arrangement as shown. 


Static and Dynamic Analysis of an Area-Type Flowmeter 


In making the analysis to be given, assumptions are made as 
follows: 


1 True flow source. In other words, it is assumed that the 
flow rate is not disturbed by the flowmeter. This assumption can 
be realized if the pressure drop across the meter is much less than 
the pressure drop across other parts of the flow circuit. 

2 Incompressible and frictionless fluid. In actual practice, if 

the Reynolds number is reasonably high the viscosity effect will 
be small. The metering portion of the flowmeter was designed to 
have the characteristics of a true orifice. 
& 3 The pressure force acting on the poppet is equal to the 
pressure difference between the upstream and downstream cham- 
bers times the cross-sectional area of the upstream chamber. 
This assumption is justified when the flowmeter is so designed 
that the velocity of the fluid in the upstream chamber is small 
compared with the jet velocity at the opening formed between the 
poppet and its seat. Since the upstream velocity is small, the 
dynamic forces due to flow will also be small. 

4 The force in the flowmeter spring is constant over the full 
range of output displacement. This assumption is correct only 
when the precompression of the spring is very high compared with 
the variation of the force in the spring. 

5 The feedback spring has no effect on the flowmeter. In 
practice, the feedback spring force is much smaller than the 
flowmeter spring force. 


Referring to Fig. 2, the equation of motion is 
Mj+Ca+F +f = P,A, (1) 
where 


M, total mass of the moving parts plus the effective mass 
of fluid which is pushed around when the flowmeter 
poppet moves 

effective damping coefficient 

initial spring force 

possible friction force at the seal 

pressure differential between the upstream and down- 
stream chambers of the flowmeter 

A; cross-sectional area of the upstream chamber 

y displacement of the flowmeter 


The Bernoulli equation for flow is 
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Fig. 5 Static characteristics of single area-type flowmeter 


2P, 
= cmd yy —f 


= flow rate passing through the flowmeter 
diameter of the upstream chamber 
discharge coefficient which is assumed to be constant 
density of the fluid which is assumed to be constant 


Combining Equations (1) and (2) 


1 _|pA, 1 


The steady-state response of the flowmeter is obtained by 
allowing y and 7 = 0, then 


where 


1 pA, 
curd, Y2F +f) 

The relationship between Q and y can be plotted as shown in 
Fig. 5 The important thing to note is the effect of possible fric- 
tion. The percentage error of the flowmeter due to the friction 
(generally defined as the ratio of the deviation of the output and 
the maximum output) is smaller at small flows. When the flow 
approaches zero so does the percentage error. Thus the flowmeter is 
a very accurate device at small flow conditions. When two flow- 
meters are used together, the hysteresis curve will look like Fig. 6 
instead of the more conventional type hysteresis curve as shown 
in Fig. 7. This behavior is fortunate because in servo systems 
small flow accuracy is usually of the utmost importance. 

Since the differential equation, Equation (3), is nonlinear, the 
dynamic performance of the flowmeter can best be revealed by 
investigating the behavior due to small perturbations. Equation 
(3) can be rewritten as 


Q.? 


Ku, Mat Ca +P (4a) 


where 
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Fig. 6 Static characteristics of two area-type flowmeters with force 
summing 
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Fig. 7 Normal hysteresis curve 


pA, 
K = 


For small perturbations about a given steady-state condition of 
and Yo 

—2KQ,,**y,~*Au 2Ky,~*Q,,4Q M,Ag C,Ay (5) 
where Ay and AQ indicate variation in y and Q, but for steady- 

state conditions 

2 
(6) 


Substituting Equation (6) into Equation (5) 


F +f) + 
Mg 
Yo 


the natural frequency of the flowmeter is thus 
1 +h 
2 
24 y.M, 


The damping ratio is 


(8) 


y 


From Equation (7), it is seen that the natural frequency of the 
flowmeter due to small perturbation depends on the initial flow 
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Fig. 8 Flowmeter frequency versus signal level 
condition. For very small flows, if the flowmeter is disturbed, 
the natural frequency will be very high. According to Equation 
(7), when the flow approaches zero, the natural frequency ap- 
proaches infinity. This obviously cannot be the case. At low 
flows, the assumption that fluid friction and compressibility can 
be neglected is not valid. Therefore, the relationship between 
the natural frequency and flow level is somewhat as shown in Fig. 
8. The minimum frequencies correspond to maximum flow con- 
dition. For a typical flowmeter with a flow capacity of 7 gpm, 
and operating with a pressure drop of 120 psi; the calculated 
minimum natural frequency is 1200 eps. 


Discussion of the Valve System 


A four-way, two-stage, flow-control valve with an electrical 
input was developed by the authors. This design utilized a 
torque motor to convert electrical signals into a torque which 
acted on the flapper of a nozzle-flapper-type first-stage valve. 
Output from the first-stage valve drives a spool-type output stage 
valve. A torque proportional to the valve flow output is fed 
back to the flapper by the flowmeter. Fig. 3 schematically il- 
lustrates the working relationship of the valve components. Fig. 
9 illustrates the dimensions used in the following valve analysis. 

As designed, maximum output flow is attained with an 8 milli- 
ampere input signal. Valve gain is easily varied from 1 gpm per 
milliampere to 0.12 gpm per milliampere, by changing the rate 
of the feedback spring. 

Determination of the characteristics of this flow control 
valve can be made by a mathematical analysis. In the following 
paragraphs, the characteristics of the valve components are dis- 
cussed and described mathematically. The equations are then 
combined into an expression which relates the action of the in- 
dividual components in one equation. 

1 Torque Motor. The subject valve was designed so that the 
torque-motor deflection is relatively small, resulting in an out- 
put torque which is very nearly a linear function of both the dif- 
ferential current and the armature deflection.? Elastic deforma- 
tion of the armature and flapper assembly was found to be 
negligible; therefore, the angular flapper displacement is as- 
sumed to be the same as the angular armature displacement: 


T,, = — (9) 
where 
K,, the magnetic spring rate, is a negative quantity 
2 Feedback Forces. As discussed in another section of this 


?R. H. Frazier and R. D. Atchley, ‘A Permanent-Magnet-Type 
Electric Actuator for Servovalves,"” Dynamic Analysis and Control 
Laboratory, Report No. 66, Massachusetts Institute of Technology, 
Cambridge, Mass., June, 1952. 
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Fig. 9 Schematic diagram of valve with flow feedback 


paper, the natural frequency of the flowmeter can be very high. 
The natural frequency of the flowmeters used is in the order of 
1200 cps (at an 8-gpm flow rate) which is considerably higher than 
the natural frequency of the whole valve. Therefore, the flow- 
meter dynamics can be neglected and the flowmeter displacement 
can be as previously expressed in Equation (4). Feedback force 
will be: 


F, = Ky(b0 + y) = + KK 


3 Flapper Flexure Torque. This expression sums the effects of all 
the mechanical springs acting on the flapper-armature assembly 
other than the flowmeter springs. Flexure torque is given by: 


T, = K,6 (11) 


4 Nozzle Reactive Force. When the distance of the flapper 
from nozzle is less than '/;») the nozzle diameter, forces due to the 
approach velocity amount to only a few per cent of the total 
flapper force and can be neglected. The forces due to nozzle 
pressures can then be stated as: 


F, = A,(P; — Ps) 


(10) 


(12) 


5 First-Stage Pressure Flow Characteristics. The pressure flow 
characteristics of the first stage are as described by Blackburn! 
and Shearer.* For small perturbations the flow output can be 
characterized by the following linearized equations: 

Q = = — K\(P, — (13) 


If fluid compressibility is neglected, the first stage output can 
be related to output stage velocity by: 


Q: = —Q: = (14) 

6 Second-Stage Characteristics. Assuming a perfect second-stage 

3 J. L. Shearer, ‘Dynamic Characteristics of Valve-Controlled Hy- 
draulic Servomotors,” Trans. ASME, vol. 76, 1954, pp. 895-903. 
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valve (i.e., perfect porting edges, zero lap, and no leakage) and 
for small spool displacements: 


(P, — P, — P, — P,) 
p 


Since the flowmeter pressure drop and supply pressure are 
constant and assuming a no-load condition (P, = 0) and zero 
drain pressure, the flow equation can be written: 


Q, = 


Q, = (15) 


(16) 


where 


K, = 


As discussed by Lee and Blackburn,‘ a steady-state force is 
exerted on the output spool by the fluid flowing past the metering 
edges. This force can be expressed by: 


P, = 2pQ,Vs cos 69° 


For zero load and drain pressures, a constant flowmeter drop 
and a constant supply pressure: 


FP, = KQ, (18) 


where 


K, = 2 cos 69° VY p(P, — Py;) 


A transient force also exists on the power spool.’ This force 
can be expressed by: 


Fy = — V(P, — Py — P, — P,)p dz/dt 


Or for zero load and drain pressures and for a constant flow- 
meter drop and a constant supply pressure: 


where 


7 Summation of Torques on Flapper. Since the friction forces 
acting on the flapper are negligible and considering Newton’s 
second law, the torques acting on the flapper can be summed 
as follows: 


T,, = + BDO + Fyb + 7, + + Kub?@ (20) 


by substituting Equations (9), (10), (11), and (12) into Equation 
(20) and collecting like terms: 


(JD? + BD + K,)9 

= K,I — aA,(P; — P:) — (21) 
where 

K, = 2b°Ky, + K, + Ko 


8 Summation of Spool Forces. When Newton’s second law is ap- 
plied, all the forces acting on the power spool can be summed as: 


4S.-Y. Lee and J. F. Blackburn, “Contribution to Hydraulic Con- 
trol—1, Steady-State Axial Forces on Control Valve Pistons,” 
Trans. ASME, vol. 74, 1952, pp. 1005-1011. 

® S.-Y. Lee and J. F. Blackburn, “Contributions to Hydraulic Con- 
trol—2, Transient Flow Forces and Valve Instability,” Trans. 
ASME, vol. 74, 1952, pp. 1013-1016. 
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— P;) = M,D*z + K,Dz + KQ, (22) 


9 Final Dynamic Equation. Equations (13), (14), (16), 
(21), and (22) can be combined into one equation which relates 
the flow output of the valve to small changes in the input signal: 


tie 


£14, & A, K, 
K.KiK, . aA,K, 
+ A, + Kuk |} Q, = (23) 


In the valve described by this paper the forces available to 
drive the power spool were very large compared to the weight of 
the spool. Therefore, the effects of the spool mass can be neg- 
lected. Since the dynamic operation of the valve is determined 
by the left-hand side of Equation (23), the characteristic equation 
describing the valve’s dynamic operation is obtained by setting 
the right-hand side of Equation (23) equal to zero: 


J (Ki Ke 


J KK, , B (Ki Kz 
Ee + Ki K, aA,, D 
A, \ A, 


K 
K.KiK, . aA,K, 
+ [ + A + | 0 (24) 


By applying Routh’s criterion to this equation and rearranging 
terms, it can be determined that the valve will be stable if the fol- 
lowing inequality is satisfied: 


K 


K,A,; K, A, 
+ ( 4.) (25) 


From this equation, it is seen that the flapper damping coeffi- 
cient B is a very important factor for stable operation. In prac- 
tice, part of this damping comes from the eddy current losses in 
the torque motor and part comes from the viscous loss in the 
fluid in which the flapper is immersed. 

The steady-state output can be obtained from Equation (23) if 
all the derivatives in the equation are made equal to zero, but the 
resulting equ&tion will not show the effects of varying the load or 
supply pressures. To determine the effects of pressure changes 
on flow output under steady-state conditions: 

Change Equation (13) to read 


0 = K.0 — K,i(P; — Ps) (26) 
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F, = K,Dz (19) 


Change Equation (18) to read 
Fy = K, VP, P, Py 
where 
K, = 2 cos 69° Vp 
Change Equation (20) to read 
T, = Fy + T, + + 
Change Equation (22) to read 


(28) 


A(P, — P:) = K, VP, — P, — P, (29) 


Combining Equations (26), (27), (28), (29), and (30) in a manner 
analogous to determining the valve’s dynamic characteristics the 
following equation can be obtained: 


bK tm K,A, A, 


(30) 
1+K, 


In practice 


K, VP, =£> <1 


2K,A, A, 31) 


= 
a 
af 


DIFF. CURRENT ( MILL!- AMP. ) 
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and the valve output can be expressed as: 


K, 


A] —— 
bK eK to 


(32) 


The existence of the left-hand term of Equation (31) in Equation 
(30) does make the valves’ output slightly sensitive to load pres- 
sure changes. Complete insensitivity to load-pressure changes 
could be attained by compensating for the spool flow forces or 
reduced by using a first stage which is insensitive to the spool flow 
forces (this would be equivalent toaA, = 0). Adequate pressure 
insensitivity was attained by controlling the coefficients of Equa- 
tion (30). 


Test Results 


The following paragraphs summarize the performance of valves 
using flow feedback as determined by test data. 

Static data are shown in curves of Figs. 10 through 13. Curves 
10 and 11 show the relationship between input signal and total 
valve flow (load flow plus first-stage flow plus internal leakage). 
Load pressure for these data was zero. Note that there is prac- 
tically no change in gain between curves 10 and 11 even though 
the supply pressure difference is two to one. The flattening of 
the 1000-psi curve at large signals indicates flow saturation due 
to the limited capacity of the output stage. Hysteresis at 2000 
psi is 2'/; per cent. Losses in the torque motor account for 1'/; 
per cent of the hysteresis. The remaining one per cent hysteresis 
is attributable to mechanical friction. The null flow is mostly 
from the flapper-nozzle valve. 

Fig. 12 shows the pressure gain of the valve when the load ports 
are blocked. It is seen that it takes about 1 per cent of the 
maximum signal input to cause the load pressure to switch com- 
pletely. 

Fig. 13 is the complete pressure flow and input characteristics. 


2000, 
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Fig. 12 Pressure gain of the valve—blocked load ports test 
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Fig. 14 Dynamic response of valve 


Fig. 13 Pressure flow characteristic curves of the valve 


It should be noted that in a very large region the valve behaves 
like a perfect flow controller. When the signal is too high or 
when the load is too great, the flow drops due to the limited capac- 
ity of the output stage. 

Fig. 14 shows the frequency response resuit of the system. 
These data relate the magnitude and phase lag of the valve output 
flow to the input current. The input to the valve was a sinusoidal 
current with a total amplitude of 4 milliamperes. 


Conclusions 


After conducting innumerable tests a small portion of which is 
shown here, the authors are convinced that the flow feedback 
valve is practical. One possible disadvantage is the added com- 
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plication of the flowmeters. But since all the added parts such 
as the flowmeter poppets, check valves, and springs are non- 
precision parts, and because of the proven reliability of the flow- 
meters, it is the opinion of the authors that the added advantages 
more than offset the added complications. 
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A Thermocouple System for Measuring 
Turbine-Inlet Temperatures 


A fast-response, unique thermoelement combination of platinum-15 per cent iridium and 


palladium, whose emf output is about three fourths that of Chromel/Alumel at 2000 F 
(1093 C), has been found reliable from 1800 to 2300 F (982 to 1260 C) for more than 
400 hours in a combustion atmosphere. A new, stable nickel-4.3 per cent silicon match- 
ing-lead-wire alloy was prepared for use up to an ambient temperature of 1400 F (760 
C). Additional tests provided the best choice of insulation, thermocouple dimensions, 


sheath material, connectors, and harness design for aircraft jet engines. 


While it has 


heretofore been impractical or very difficult to determine turbine-inlet temperatures 
above 1600 F (871 C), this system is capable of measuring directly the most critical of 
all operational variables and accordingly allows safe potential increases in engine per- 


formance. 


Introduction 


Bonin THE TESTING AND OPERATION of turboprop 
and turbojet aircraft engines, gas-temperature measurements are 
required to determine performance and to protect vulnerable 
machine parts, such as turbine buckets, from premature failure. 
For both functions, it has been customary to position several 
Chromel/Alumel thermocouples either in the engine exhaust gas 
stream or occasionally in the turbine inlet—provided that about 
1600 F is not exceeded. Depending on operating conditions, the 
lower exhaust and the higher turbine-inlet temperatures may or 
may not be directly related; hence, large safety factors are 
usually applied for adequate protection. Since real knowledge of 
turbine-inlet temperature permits greater efficiency and reduces 
safety hazards, it should be measured with a direct sensing sys- 
tem rather than by inference. The advantages are many, e.g., 
an aircraft operating at a Mach number of one would realize a 
thrust gain of more than 30 per cent by changing turbine-inlet 
temperature, if known accurately, from 1550 to 2000 F (843 to 
1093 C). 

The purpose of this paper is to report the development and 
evaluation of a thermocouple system for preturbine installation 
where the prob: tips may reach steady-state temperatures in the 
range of 1800 to 2300 F, with transients up to 2500 F (1371 C), 
and the sheath or support material may be subjected to a maxi- 
mum of 2100 F (1149 C). Other desired features were that: 


1 The emf deviation of the thermoelements shall not exceed 
+0.50 per cent during 400-hour exposure to combustion gases 
flowing up to a Mach number of eight tenths. 

2 The over-all system accuracy shall be within +0.75 per cent 
when the ambient temperature of the harness does not exceed 
1400 F. 

3 A high thermoelectric emf in excess of 22 millivolts at 
2000 F shall be available. 

4 Nospecial cooling provisions shall be required. 

5 Prime emphasis shall be placed on system reliability and 

Contributed by the Instruments and Regulators Division and pres- 
ented at the Instruments and Regulators Conference, Cleveland, 
Ohio, March 22—April 2, 1959, of Tae American Society or Me- 
CHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript recetved at ASME Headquarters, October 
10, 1958. Paper No. 59—IRD-1. 
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freedom from excessive high-temperature oxidation and mechani- 
cal failure throughout normal engine life. 


Experimental Procedure and Results 


The text following is organized into logical sections describing 
component parts (thermoelements, lead wire, insulation, sheath 
material, connectors, and harness) and characteristics (time re- 
sponse) of the over-all system. 


Thermoelements 


The need for good oxidation resistance dictated that the higher- 
melting noble metals—rhenium, rhodium, platinum, palladium, 
and their various alloys—be considered first. On reviewing pub- 
lished tables of thermoelectric-emf data, such as those prepared 
by Roeser and Wensel [1],' the proposal was made that a high 
signal output might be provided by the combination of a plati- 
num-iridium alloy versus palladium. The platinum-iridium 
alloys offer a high positive thermoelectric emf; of the noble 
metals, only palladium is negative to platinum. Although Barber 
and Pemberton [2] found that palladium possesses excellent sta- 
bility when placed in combination with silver from 400 to 1100 F 
(204 to 593 C) there have been few other applications of this 
thermoelement material. While it is known that palladium oxi- 
dizes superficially at about 1300 F (704 C), the oxide decomposes 
above 1600 F to leave bright metal useful for this particular ap- 
plication. 

To simulate preturbine atmospheres, a combustion chamber 
was designed and operated on Bunker-two fuel oil containing 0.4 
per cent sulfur. Exposure temperatures were maintained to 
+25 F in the 2000 to 2500 F range and to +50 F in the 2500 to 
2700 F range. The per cent emf deviation at 2000 F was de- 
termined for 15 different thermoelement combinations by noting 
the change from initial calibration, against a well-standardized 
platinum-10 per cent rhodium/platinum thermocouple, at ex- 
posure-time intervals of 25 or 50 hours. Representative stability 
curves are shown in Fig. 1; a summary of results is given in Table 
1. Although prepared as rods surrounded by insulation and an 
outer concentric tube of palladium, the tungsten, titanium, 
Chromel, nickel-18 per cent molybdenum, and platinum-clad 
tungsten thermoelements all oxidized and failed within 50 hours. 
Thermoelectric-emf curves were determined up to 2000 F for all 
thermoelements; a few of these are compared in Fig. 2 to the 


1 Numbers in brackets designate References at end of paper. 
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Fig. 1 Representative stability curves for four thermoelement combina- 


tions exposed to combustion atmospheres at 2300 and 2400 F 


Table 1 Results of busti hamber testi 


Maximum time, hours 
To exceed 


Maximum +0.5% 
temperature, emf For 

Thermoelements* deg F deviation failure 
Tungsten/palladium 2400 18 — 18 
Titanium/palladium 2200 25—- 25 
Chromel/palladium 2300 50 — 50 
Nickel-18% molyb- 

denum/palladium 2300 50 — 50 
Platinum-clad tung- 

sten/palladium 2400 50 — 50 
Platinum-15% irid- 

ium/platinum 2400 50 150— 
Platinum-10% _ irid- 

ium/platinum 2400 150 200 — 
Iridium-60% 

dium/iridium?® . 2400 400 + 400+ 
Platinum-15% _ irid- 

ium/palladium 2300 400 + 400+ 
Platinum-10% _ irid- 

ium/palladium 2300 600 + 600 + 
Iridium-50% rho- 

dium /iridium-10% 

rhodium 2400 600 + 600+ 
Platinum/ palladium 2400 600 + 600+ 
Rhodium /palladium 2400 600 +- 600+ 
Platinum-20% rho- 

dium /palladium 2400 600+ 600 + 
Platinum-10% rho- 

dium/platinum 2700 800+ 80Q+ 
Platinum-13% rho- 

dium /platinum 2700 800+ 800+ 


* The notation used is that the more negative thermoelement be 
given last. 


+ Iridium almost completely volatilized after 312 hours. 


output characteristics of Chromel/Alumel. Unfortunately, the 
most stable systems tested, e.g., platinum-13 per cent rhodium/ 
piatinum and rhodium/palladium, tend to have a low signal out- 
put. The platinum-15 per cent iridium/palladium combination, 
whose output is about three fourths that of Chromel/Alumel at 
2000 F, also has acceptable stability and was chosen as the most 
suitable compromise. 

Another advantage of using palladium as a thermoelement is 
that it is only about one fifth, one seventh, and one thirteenth as 
expensive as rhodium, platinum, and iridium, respectively [3]. 
Development materials were chemically pure, i.e., grade two in 
the platinum grading system, and possessed a variability of 
+0.35 per cent in thermoelectric emf between lots. Acknowl- 
edging that this degree of inaccuracy is present until platinum-15 
per cent iridium/palladium are qualified as standard thermoele- 
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Table 2 Corresponding values of temperature and emf for platinum-15 
per cent iridium/palladium thermocouples” 


(Reference junction at 32 F) 


Temperature, emf output in 

deg F Cc millivolts 
250 121 2.29 
500 260 5.38 
750 399 8.94 
1000 538 12.94 
1250 677 17.39 
1500 816 22.25 
2000 1093 33.12 


For over-all system? 
Using Nichrome/nickel-4.3 per cent silicon 
—83 


lead wire 


—64 —1.12 

—39 —39 —0.70 
1.0 —0.31 

84 29 0.58 
121 51 0.94 
158 70 1.30 
250 121 2.32 
500 260 5.46 
750 399 9.06 
1000 538 13.04 
1250 677 17.46 


* Thermoelements consist of 16-gage wire butt welded at an open 
junction. 


6 The thermoelement-lead wire junction was located six inches 


from the open end and subjected to increasing ambient temperatures 
during calibration. 


ments, their thermoelectric calibrations are given in Table 2, with- 
out and with matching lead wire. Since, the lead-wire-thermo- 
element junction was at a temperature of 1714 F (933 C) when 
the test-probe tip measured 2000 F, there is a difference of —1.5 
per cent between the thermoelements alone and the over-all sys- 
tem. When the lead-wire-thermoelement junction temperature 
is less than 1400 F, as it can be by design and is for lower test- 
probe tip temperatures, the difference is within the desired 
+0.75 per cent. 
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Lead Wire 


To conserve noble-metal thermoelements when exposure con- 
ditions do not demand them, it is desirable to use low-cost base- 
metal lead wires having the same emf output as the thermoele- 
ments over the temperature range of interest. Before matching 
empirically the platinum-13 per cent iridium/palladium combina- 
tion, a statistical approach was employed to predict the com- 
position of base-metal alloys giving a close match to platinum- 
13 per cent rhodium/platinum. It initially appeared that the 
most likely choice would be Nichrome coupled with a stainless 
steel of the AISI-300 series. Experimental emf data were then 
obtained at 500, 1000, and 1500 F on 48 combinations of three 
different Nichrome wires and sixteen different stainless-steel 
wires. The emf output for one type of Nichrome (n) and stain- 
less-steel (8) wire combination at one temperature (7') can be ap- 
proximated by: 


yn/s, T) = bo + byw, + be +. .... + buwn + 


(1) 


where w; to w,, are the analytic weight per cents of alloy elements 
(C, P, Ni, Cr, Mn, 8, Mo, Cb, Cu, Ti, and Si, respectively ) wi is 
included as an independent random variable, and the coefficients 
bp to by: are determined bv least-square analyses of experimental 
data. Nine such regression equations were obtained; since the 
b’s pertaining to y’s of each different temperature level vary, it 
should be possible to design lead wires with a desired emf output 
by a proper choice of alloy elements. 

The closest calculated match was a nickel-base alloy containing 
19.7 per cent Cr, 0.5 per cent Mn, 0.3 per cent Fe, and 1.3 per 
cent Si paired with an iron-base alloy containing 12.4 per cent Ni 
and 18.7 per cent Cr with traces of C, P, Cb, and Ti. If considered 
individually, C, Mn, S, and Si appear to increase emf ovtput 
while P, Ni, Cr, Mo, Cb, Cu, ard Ti lower it. As the calculated 
result could not be relied on to give a perfect fit, melts were pre- 
pared using commercial foundry techniques to confirm the 
statistical analysis. Sand-cast ingot bars were rolled down to a 
diameter of 0.289 inch and annealed at 1830 F (1000 C) for 30 
minutes before testing. None of the bars measured had an emf 
deviation of less than +5.0 per cent from the platinum-13 per cent 
rhodium/platinum thermocouple, indicating that alloy inter- 
actions can not be neglected and that greater experimental 
accuracy is needed for initial b values. 

On conducting a survey of the thermoelectric characteristics 
of commercial lead-wire systems, it was observed that Nichrome/ 
nickel-3 per cent silicon deviated by +2.5 per cent from the 
platinum-10 per cent iridium/palladium thermocouple. By 
increasing the degree of cold work in the nickel-3 per cent silicon 
alloy, maximum deviation was reduced to +0.5 per cent, but 
this treatment gave the wire poor stability at high temperature. 
To match the higher output provided by platinum-15 per cent 
iridium/palladium, increasing amounts of silicon were added to 
the negative nickel-base alloy until close agreement was obtained 
with Nichrome/nickel-4.3 per cent silicon when the Nichrome 
was annealed at 1830 F for 30 minutes and the nickel-4.3 per cent 
silicon alloy was annealed at 1750 F (954 C) for one hour. Lead- 
wire deviation was determined by placing a lead-wire junction 
against a noble-metal junction in a furnace at known tempera- 
-ture; the outputs of each were measured with a separate K-2 
potentiometer. The per cents of deviation were —0.47, —0.71, 
'+0.56, +0.53, —0.39, and —1.40 at 263, 499, 780, 1010, 1316, 
and 1442 F (128, 260, 416, 543, 713, and 784 C) respectively. 
It should be remembered that a deviation of +0.53 per cent at 
1010 F represents 5 F, and a 5 F error to the whole system operat- 
ing at 2300 F is only an inaccuracy of +0.22 per cent. Acceler- 
ated stability tests were conducted by exposing lead wire at 1830 F 
. for 24-hour intervals and measuring the change from the initial 
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Fig. 3 Calculated resistance curves showing the effect of thermocouple 
design and dimensions 


calibration; after 528 hours, the emf deviation was +0.42 
per cent. 


Insulation 


With the standard-design thermocouple using two parallel 
thermoelements in an infinitely conducting sheath: 


p, [ 28 (D4 
2 
in| D, + S*) 


where RF is the resistance between thermoelements, p is insulation 
resistivity, | is probe length, S is the distance between thermo- 
elements, and D, and D, are thermoelement and insulation outer 
diameters, respectively. With the concentric design: 


R = ——I|n (3) 


where D, is the diameter of the central thermoelement and D, 
is the inner diameter of the outer thermoelement. In both 
equations p and | are the main factors influencing resistance; 
owing to practical size limitations, the other thermocouple 
dimensions have relatively little effect. Since there is obviously 
a limit to which | can be decreased, p is a most effective variable. 
In addition to possessing high resistivity, the insulation must 
also offer thermoelement support, resist thermal shock and 
vibrational stress, cause no abrasion, and be chemically compati- 
ble with the thermoelements. A comparison of the two thermo- 
couple designs and dimensions, when p equals 10* ohm-cm and / 
equals three inches, is shown in Fig. 3. A concentric thermo- 
couple with an inner thermoelement diameter of 0.005 inch and a 
standard-design thermocouple with a thermoelement diameter 
of 0.02 inch, where S = 0.486 D., have equivalent insulation 
resistance when the outer diameter of each is about 0.22 inch. 
These outer-diameter values do not include sheath dimensions 
or, in the concentric case, the extra insulation required to separate 
the sheath from the thermoelement. At greater outer diameters, 
the standard design is superior, but, if its thermoelement 
diameter is increased, this design is no longer better than the 
concentric. 
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Fig. 4 Experimental and published electrical resistivities of various in- 
sulation materials 


1400 


Periclase, beryllium oxide, magnesium oxide, and aluminum 
oxide were initially considered as insulation materials. Re- 
sistivity-versus-temperature curves were obtained on the latter 
two materiais swaged into the standard parallel-wire design; 
the results of these tests are shown with the data of earlier 
investigators in Fig. 4. Swaged aluminum oxide was preferred, 
rather than swaged magnesium oxide, because of its higher 
resistivity. In its natural crystalline state, periclase is fabricated 
with difficulty; even though its mechanical strength is good, 
cyclic vibration causes early failure. Swaging crushed periclase 
destroys its high resistivity characteristics. Beryllium oxide is 
toxic and relatively expensive to fabricate. While not yet 
completely evaluated by extensive service testing, aluminum 
oxide appears to have satisfactory mechanical, physical, and 
chemical properties over the required temperature range. 


Sheath Material 


At the segmental entry to a turbine, a clear definition of 
temperature and velocity gradients at any time is impossible 
because of their dependence on flow conditions from the com- 
pressor. Pulsation, surging, or shock from the compressor 
delivery can radically alter the situation; under these cireum- 
stances, rigorous mechanical requirements are difficult to specify. 
Use of a noble-metal sheath material would be highly impractical, 
and so consideration was given those base metals which are 
operational up to a maximum temperature of about 2100 F. 
If no intergranular oxidation is desired for high reliability, the 
most promising materials are Inconel-702, Hastelloy-X, and 
Inconel. Confirmation tests showed that serious intergranular 
attack does not appear, even after 96 hours at 2450 F (1343 C) in 
an oxidizing atmosphere, in Inconel-702 while it does in Hastelloy- 
X and Inconel. Various cermet and ceramic-coating combina- 
tions were combustion tested at 2400 F without sufficient success 
to warrant recommendation. The aggregates of metal and 
ceramic possessed no measurable ductility. When heating or 
cooling a metal protected with a ceramic coating, sharp tempera- 
ture gradients exist across the low conductivity ceramic. This 
difference in temperature coupled with a difference in expansion 
coefficient results in very high stresses, and the expanding metal 
stretches the coating until rupture occurs. 

Under normal operating conditions, the force of the combustion 
gases against the 0.25-inch outer diameter, 3-inch long probe will 
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be more or less constant at a value no greater than 1U pounds at 
2100 F; on designing the probe in the shape of an airfoil, this 
value can be decreased by an order of magnitude. Since the 
maximum fiber stress in a fixed-free cantilever of circular cross 
section is: 


(4) 
wd? 

where F is the applied load, | is probe length, and d is diameter, 
substitution of the forementioned values shows that the sheath 
material may undergo a stress of about 1950 psi. Normal 
exposure at the fixed position would be 400 hours at about 1600 F; 
commercial stress-rupture and creep data indicate that the alloys 
considered would all adequately eliminate the problem of probe 
breakage. Owing to its superior oxidation resistance for tran- 
sients up to 2400 F, the final choice for this application was In- 
conel-702. 


Time Response 


To decrease the difference between transient changes in true 
temperature (t,) and indicated thermocouple temperature (;), it 
was desired for automatic controls that the system be designed 
to have a maximum time response of 1.2 seconds at a mass flow 
rate of six pounds per second per square foot. Considering only 
convective heat transfer to or from a machined-loop, open-end 
junction: 


6De (5) 


where ty is the initial temperature of both the gas and the thermo- 
couple tip before a step-temperature change occurs, 7 is the time 
elapsed after the change, 4 is the average density of both thermo- 
elements, D is the average wire diameter, c is the average specific 
heat, and h is average heat-transfer coefficient. Characteristic 


time is identified as the time, in seconds, when tT = a and the 


right-hand member of equation (5) equals 0.632. The ratio 2 = 

1 
be, can be applied to two thermocouple systems of different ma- 
terials when their diameters and heat-transfer coefficients are 
equal. Since a Chromel/Alumel system has an average 6c. of 62 
Btu per deg R per cube foot and noble-metal systems have an 
average 6,c; of 44 Btu per deg R per cube foot, the time response 
of platinum-15 per cent iridium/palladium should be faster than 
Chromel/Alumel by a factor of 1.41. Moffat [7] has shown that 
a 16-gage Chromel/Alumel loop junction has a characteristic 
time of about 1.3 seconds at 1600 F at a mass flow rate of six 
pounds per second per square foot. From the previous analysis, 
it was reasonable to expect, and later confirmed, that the charac- 
teristic time of a noble-metal system under the same conditions 
would be about 0.9 second. 

The time response of various Chromel/Alumel thermocouple- 
junction configurations was measured with a specially constructed 
test facility where the thermocouple is first heated in a tubular 
electric furnace, and then it is quickly lowered into a cool, moving 
air stream. An orifice plate used in conjunction with a U-tube 
manometer measured mass flow rates. The thermoelectric emf 
of the thermocouple is fed into a high-speed recording oscillo- 
graph; decade resistance boxes are used for the desired im- 
pedance matching and recorder sensitivity. Time response is 
read directly, with an accuracy of 2 per cent, as the time required 
to reach 63.2 per cent of the recorded temperature change. For 
a temperature interval of 1000 F and a mass flow rate of six 
pounds per second per square foot, the time responses of 18-gage 
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Table 3 Time response results 


Thermo- ———Characteristic time at 1600 F-seconds—— 

couple Mass flow pounds per second 
no, rates of 2 4 per square foot 

1.01 

1.06 

1.08 

1.15 

1.08 


1.01 


Chromel/Alumel machined-loop, beaded, fused-splice, and con- 
centric junctions were 1.17, 1.92, 2.59, and 0.71 seconds, respec- 
tively. Sample thermocouples, made from 16-gage (0.051-inch 
diameter) platinum-15 per cent iridium/palladium thermoele- 
ments and Nichrome/nickel-4.3 per cent silicon lead wire, were 
sent to the National Bureau of Standards for confirmation of 
response rates. The junctions were butt welded and formed into 
machined loops protruding 0.313 to 0.438 inch beyond the ce- 
ramic insulation. The data listed in Table 3 show that re- 
sponse rate requirements were met. 


Connectors 


A reliable thermocouple connector should have low contact re- 
sistance, not cause a high spurious-voltage error, and provide a 
reasonably long mechanical life. An evaluation test for this com- 
ponent consisted of cyclic vibration at 45 g’s, 190 cps, and 25-mil 
vertical displacement on heating the connector assembly to 1500 
F. Voltage was applied across the contacts, and failures from in- 
termittent contact, high resistance, or an open circuit were 
monitored on an oscilloscope. Under these conditions, a standard 
production connector, which has operated satisfactorily for over 
150 hours in the field, would fail within about 2.5 hours’ test time 
at 1000 F. Because of inherent errors and servicing difficulties, 
the stud and nut type of connector was discarded early; another 
connector which utilizes a tapered thread to obtain contact pres- 
sure failed because of lead-wire breakage and !oose interconnecting 
nuts. Two and four-contact connectors were then subsequently 
designed which gave performance far in excess of the 400-hour 
life required. 
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Harness Design 


The purpose of a harness is to permit the electrical joining of 
more than one thermocouple and to present an emf to the end 
device which is an average of the thermocouple pattern through- 
out the engine cross section. Common-junction, ladder, compen- 
sated-ladder, equal-resistance, and geometrically balanced har- 
nesses are the five basic types used for averaging. The common- 
junction design is simple and accurate, but it involves balancing 
wire lengths to yield equal resistance; multiple leads running to 
the junction point result in extra weight. With the ladder 
harness, individual thermocouples are connected between two 
common leads; unless the common leads are much larger in 
diameter than the branch wires, the thermocouples most re- 
moved from the end device would centribute the least output and 
introduce averaging errors. The compensated-ladder harness is 
adjusted to give a circuit which balances the effective contribu- 
tion of each thermocouple to the output. In an equal-resistance 
harness, the leads are sized so that resistance effects are the same 
for all junctions, but a change of ambient temperature from the 
design value will cause a large upset in the balance of the thermo- 
couple circuit due to differences in temperature coefficient of re- 
sistance of the leads. The geometrically balanced system is es- 
sentially a series of common-junction harnesses combined into an 
entire system; in the normal form, pairs of thermocouples are 
joined and then sets of pairs are joined. However, temperature 
variations between junctions or in the harness will introduce small 
errors, and loss of a branch will create a large averaging error. 

Since an available engine application required 18 dual-junction 
probes or a total of 36 thermocouples, the design selected utilized 
six interchangeable elements of three probes each (Fig. 5); the 
three branch circuits are balanced by the use of a compensating 
resistor to give an average output. The elements are intercon- 
nected by a harness which forms a six-element compensated- 
ladder circuit. This arrangement facilitates engine installation 
without introducing special problems; the harness is to be sup- 
ported from the engine so that junction-box vibration does not 
unduly load the thermocouple structure. By reducing the num- 
ber of connectors from 18 to 6, the probability of failure is cor- 
respondingly reduced, and the small number of connections results 
in greater care being exerted during installation and replacement 
operations. 
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Conclusions 


1 From the preceding systematic concert of effort, certain 
materials, designs, and dimensions were developed for each com- 
ponent of a turbine-inlet thermocouple system. 

2 Platinum-15 per cent iridium/palladium thermoelements 
produce a thermoelectric emf of 33 millivolts at 2000 F, and 
their emf deviation does not exceed +0.35 per cent during 400- 
hour exposure at 2300 F in a combustion atmosphere. 

3 A maximum time response of one second at a mass flow rate 
of six pounds per second per square foot is provided by 16-gage 
inert-arc, butt-welded junctions of a standard-design noble-metal 
combination. 

4 A matching nickel-4.3 per cent silicon lead-wire alloy of the 
same gage was developed using standard foundry and fabrica- 
tion techniques. 

5 Swaged aluminum-oxide insulation is preferred because of 
its high resistivity and satisfactory mechanical properties. 

6 The high oxidation resistance, strength, and ductility of In- 
conel-702 make it a suitable sheath material. 

7 An experimental harness consisting of six four-contact con- 
nectors and 18 dual-junction thermocouples distributed in a six- 
element compensated-ladder circuit was designed for an available 
application. 
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Estimating the Roots of the Characteristic 
Determinant for Multicoupled Systems 


A procedure is developed, based on Cauchy's residue theorem, for bounding the un- 
damped natural frequency, damping ratio, and real part of the roots of the characteristic 
determinant associated with a multicoupled system with several inputs and outputs. 
The method can lead to a locus of the roots as one or more parameters are varied. The 
underlying theory is developed and a numerical illustrative example is included. 


Introduction 


HYSICAL SYSTEMS with several interacting inputs 
and outputs are frequently encountered in engineering practice. 
Aircraft, for example, are associated with at least six degrees of 
freedom, requiring six outputs and, generally, even more inputs. 
The design of such systems, particularly controlled-system design, 
often requires a study of the possible dynamic characteristics of 
the system as the parameters are varied. This paper relates 
dynamic performance with variations of component values, and 


compares the anticipated performance of one configuration with’ 


those associated with other possible configurations. One of the 
objectives is to determine the most suitable configuration and the 
component values that provide for the best performance. The 
criteria for the choice of configuration and component values de- 
pend upon given requirements and specifications peculiar to the 
application and are outside the scope of the present discussion. 
Dynamic-performance studies are often laborious and extended, 
even when the study is restricted to a single configuration. The 
effort becomes significantly greater when it is necessary to com- 
pare the performance of several possible configurations. This is 
especially true for multicoupled systems with several inputs and 
outputs. It is advantageous to be able to establish bounds upon 
the range of variation of the component values as they affect 
dynamic characteristics, before entering upon a more intensive 
investigation. In some instances, a knowledge of the bounds 
may suffice. 

Tn a previous paper [2]' a method was developed from Cauchy’s 
residue theorem for determining limiting values of the dynamic 
characteristics of physical systems having a single input and a 
single output. It is assumed that the system under present study 
may be associated with a set of linear integro-differential equa- 
tions with constant coefficients. Properties of the system com- 
ponents are represented by the equation coefficients, usually as 
functional relationships of the various elements. The purpose of 
this design study is to determine the useful range of variation of 
the coefficients. As is well-known, the solution of a set of linear 
differential equations may be associated with the roots of the 
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characteristic determinant. The dynamic problem may be re- 
lated to the determination of the characteristic determinant roots 
as the equation coefficients vary. 


Multicoupled System Mathematical Background 


The mathematical background for multicoupled systems is de- 
veloped in Summary 1. A schematic form for the class of such 
systems is shown in the figure of the summary. Alternate forms 
of the set of linear simultaneous differential equations associated 
with multicoupled systems are given by Equations (1) and (3). 
Note that the input quantities, as well as the output quantities, 
are modified by differential operators. It is important in deter- 
mining the detailed response to any particular input to take into 
account the modifications that the system imposes on the input. 
For controlled systems, the roots of the right-hand differential 
operators are sometimes called the ‘“‘closed-chain zeros.’ How- 
ever, tne modes of the system are fixed by the roots of the charac- 
teristic determinant, Equation (4), associated with the linear dif- 
ferential equation set. The input differential operators affect the 
intensity of the various modes in co-operation with the boundary 
values. In this paper, attention is concentrated on the roots of 
the characteristic determinant. The method described in this 
paper can be extended to take into account the input-operator 
forms if it is considered necessary. 

Let all of the coefficients except one in the characteristic deter- 
minant be fixed and let it be required to determine the influence 
of this upon the characteristic determinant roots of the variable 
coefficient. Summary 1 shows how, in general, the characteristic 
determinant can be altered in form to that shown by Equation (9) 
of the summary. The characteristic determinant divided by the 
minor of the operator containing the variable coefficient is shown 
to be equal to the variable coefficient plus expressions that are in- 
dependent of the coefficient. Since the characteristic determinant 
as well as the minor are polynomials with constant coefficients, 
their ratio is a rational function. The poles of the rational func- 
tion are roots of the minor and, presumably, are known. Hence, 
by applying Cauchy’s residue theorem for a rational function, the 
zeros of the expression in Equation (9) can be determined for par- 
ticular contours on the variable complex plane. The zeros are the 
roots of the characteristic determinant. 


Cauchy’s Contours 


The form of Cauchy’s residue theorem for application to the 
characteristic determinant is given in Summary 2. It shows that 
the difference between the number of zeros and the number of 
poles in a closed contour of the root complex plane is equal to the 
number of encirclements of the origin of the function plane en- 
compassed by the map of the contour. Since the equation set 
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Summary 1 Characteristic determinant forms 


Outputs 


Gin(a) Gout(e) 
Multicoupled 
Operating System 


Schematic form of a multicoupled operating sy stem. 


c= i=l d=] = ) 


p = d/dt 


Equation (1) represents a set of s simultaneous linear integro-differential equations with r inputs and s outputs. 
Usethe operator symbols: 


k 
The equations become 
s r 

(0,26) [0.4.5] Gin(d) (3) 

c=] d=] 
The characteristic determinant associated with theset of equations is 

A = ILO, (4) 


Let there be a particular coefficient, G, in a particular operator (0 oo). It is required to determine the influence 
of the particular coefficient, @ ,upon the characteristic determinant roots. 
The determinant, A, can be expanded by minors 


A = + | (5) 
where 
Avo0) = minor of A associated with term (0.50) (6) 
and 
| (0, hoo) = A- (0,00) J Aroo) = remainder of the characteristic determinant (7) 
Then 


A (00) A 

(00) (00) 
Since A and Aygo) are polynomials they have zeros only. The poles of A/A; op) are due to the zeros of Aygo), 
while the zeros of the function are the zeros of A alone. Hence the location of the roots of the characteristic 
determinant, A, can be found by examining maps of the Cauchy contours for the right hand side of Eq. (8). Inas- 
much as the coefficient to be varied occurs io [O,o0)J, then the map of the Cauchy contour for | (0,..5] |c00)/Aroo) 
will remain invariant for all values of the coefficient, a, The same is true for all the terms in [O.o0)4 not modified 
by a. If, for example, G is the constant term in [O,60)J, then 


(00) 
A 

(00) 


Similar expressions can be derived when @ is the coefficient of a nonconstant term of (0, oo) J. 


This leads to the procedure for determining the influence of the particular coefficient, a. 


(8) 
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Summary 2 Cauchy's residue theorem for rational functions 


From Cauchy’s residue theorem,* if F(z) is a ration- 
al function and F(z) is the derivative of the function 
with respect to z, and C is a closed contour on the z 
complex plane containing Z zeros and P poles of 
F(z), then for positive encirclement of C, 
$F dz 2nj(Z-P) = , N=Z-P 
e F(z) 

The integral may be interpreted to mean that the 
argument of F(z) increases exactly 2nN radians as 
the closed contour C is traced out in a positive sense, 
or that the F-plane path encircles its origin N(-Z—P) 
times. This is the generalized mapping theorem. 

a) Cauchy’s residue theorem for rational functions 


Closed 
coatour C 


(two encircle- 
d ments of the 
origin) 


Since two zeros of F(z) are in the closed contour sd 
two encirclements of the F-plane origin occur. 
b) IIlustration of Cauchy's residue theorem for 
polynomials 


* See Reference (3) pp. 324 and 409. 


Axis of 
imaginaries 


Characteristic determinant 
root complex plane 


1 l 


(mM, 


Reciprocal of root equals 
negative characteristic time 


x 
Axis of reals “i 


a) Real root representation 


usually has only real coefficients for physical systems, the map of 
the contour is symmetrical with respect to the real axis. Again, 
since the equation coefficients are real, the roots of the character- 
istic equation are either real numbers or complex conjugate pairs. 
Fig. 1 shows that real roots appear as points on the real axis of the 
root complex plane, while complex numbers appear with a radius 
proportional to the undamped natural frequency and an angle 
whose negative cosine is the damping ratio. By determining the 
number of characteristic determinant roots within a given circle, 
the bounds of the undamped natural frequency of the roots can be 
found. Similarly, by determining the number of roots within a 
given sector, the bounds on the damping ratios can be estab- 
lished. These bounds are illustrated by Fig. 2. 

Cauchy’s residue theorem requires closed contours. Appro- 
priate contours incorporating the various bounds shown in Fig. 2 
are given by Fig. 3. The closure, as seen in Figs. 3(a) and 3(c), is 
accomplished by choosing a portion of a circle with large enough 
radius to enclose all the possible roots. It is sometimes useful to 
combine portions of two standard Cauchy contours in order to 
fix two properties simultaneously. For examp!e, a sector contour 
may be closed by a portion of a circle with known radius to de- 
termine the number of roots that have a damping ratio equal to or 
greater than a specified damping ratio, and a natural frequency 
equal to or less than a specified undamped natural frequency. 

The actual procedure for determining the bounds for the char- 
acteristic determinant roots is quite elementary. A suitable 
Cauchy contour is chosen and sufficient points are computed for 
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ADR- A> 


ADR 


k 
= angle of damping ratio 


Radius vector of root equal undamped natural 
frequency 


Cosine of radius vector angle equals damping 
ratio 


b) Complex root representation 


Fig. 1 Representation of characteristic determinant roots on complex 
plane 


the invariant part of Equation (9), Summary 1, to indicate the 
curve. Accuracy is required only when the map crosses the real 
axis. The encirclements for various values of the variable co- 
efficient are determined by translating the origin along the real 
axis. The method is similar to the one used for Nyquist’s cri- 
terion, since the latter is a special case of the procedure given 
here, 
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=_ 


(CT) ref) (ref) For all second-order modes 

' to have undamped natural 

frequencies greater than 
imaginaries n)(reg) the foots must 
f 3 ; For given relative tran- be outside of circle of radius 
sient stability all roots n)(ref) 

n re 
must be to the left of 

| the line (— ACT), 
: b) Undamped natural frequency 


a) Relative transient stability 


(ADR)... 


For all second-order modes 
to have damping ratios 
greaterthan all char- 


acteristic determinant roots 
must be inside sector of half- 


angle m7 Ar tret) 


c) Damping ratio 


Fig. 2 Interpretation of characteristic determinant root complex plane loci 


Characteristic determinant 
root complex plane 


b) Standard Cauchy contour for estimating 
undamped natural frequencies 


a) Standard Cauchy contour for estimating 
relative transient stability 


Adiret) 


c) Standard Cauchy contours for estimating 
damping ratios 


Fig. 3 Standard Cauchy contours for estimating dynamic characteristics 
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Numerical example 


Consider a system of two equations with two out- 


puts and one input as follows: 


(CO, + Cred = in (1 


(0, + = b2¢0) 22) 


where 


(0, ,] 
(O0,,]= p?+1 (13) 
=<p+4 (14) 


p> +p*+p+a (12) 


=p +11 (15) 


Characteristic determinant 


(0, 
A = = (0, ,](0,,] - (0,,)(0,,] 
114-422 12 


Contour, A = 45°(¢ = -0.7) 


45° 


Cauchy contour A = 90° (¢ = 0) 


and 
A 
(0, al 


Note: 


A 
(0, ,] 
when p—= Re!A, then 


{Reel + R2e!24 RelA 
[0,.] 


has a single pole where p = - 1.1 (18) 


_ 1)(Re!A+ 4) as) 
1.1) 
Six Cauchy contours are chosen, three sector con- 
tours with angles A = 45°, 90°, 135° (corresponding 
to €=-0.7, 0 and 0.7 respectively) and three circle 
contours with radii R = 0.1, 1.0, 10.0. 


\ Asymptot~ 
\ 


—3.65 


For 3.65 <a N=0, 
a < 3.65 N=1, 


For 3.65 <a, 


1 <a <3.65, 


a <1, 
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N=2, Peo, Z=N+P=2 
Mumm -3, P-0, Z-N+P-3 
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Cauchy contour A = 135°(¢ = 0.7) 


For 3.65 <a N=2,P=-0, Z-N+P-2 
a < 3.65 N=3, ZeN+P =3 


For 4.05 <a 
3.34 <a <4.05 N=1, 


a < 3.34 
Cauchy contour: R = 1.0 


For 59 <a N=0, P=o0, 
2.77 <a<59 N-=i, P=0, 
1.76 <a <2.77 N=3, P=0, 
1<a<1.7%6 N-=2, P=o, 
a<l N-=0, P=0 
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Ae 
/ 
/ 
4 
‘ff 
Cauchy contour: R = 0.1 
\ / -4.05 -3.34 
P =0, Zu sP 
\ / 
= N + P =0 
Z = N + P =1 
Z=-NsP-3 
= N + P =2 
Z=N+P-=0 


Cauchy contour R = 10.0 


Summarizing, for Cauchy sector contours: 
A = 45°,C=-0.7 1 root for a < 3.65 
0 roots for 3.65 <a 


1 root for a < 1 
3 roots for 1 <a < 3.65 


2 roots for 3.65 <a 


A - 135° €50.7 3 roots for a < 3.65 
2 roots for 3.65 <a 

and for Cauchy circle contours: 

R-0.1 0 roots for a < 3.34 
1 root for 3.34 <a < 4.05 
0 roots for 4.05 <a 


0 roots for a <1 

2 roots for 1 <a < 1.76 

3 roots for 1.76 <a < 2.77 
1 root for 2.77 <a < 59 

0 roots for 59 <a 


For 990 <a, N-0 


978 <a< 990, N-=1, 
-860 <a <978, N-3, 
-984 <a <-860, N =2, 

a < -984, N-0 


1 root for a < —984 

3 roots for —-984 <a < 
4 roots for —860 <a <978 
2 roots for 978 <a < 990 

1 root for 990 <a 


From the summary it is concluded: 


(1) There is one real positive root for all values of 


less than 3.65. The magnitude of the positive real 
root becomes larger as G decreases and becomes 
negative. The sign of this root becomes negative 
when a exceeds 3.65. 


There is a pair of complex roots for all values of a. 
The damping ratio is less than 0.7 and greater than 
—0.7. The real part of the roots is positive for 

a < 1 and negative for a > 1. 


One root is either negative real, or one of a pair 
of roots with negative real parts. The magnitude 
of this root is always greater than unity and less 
than 10, 


Only the real positive root can be smaller than 0.1 
in magnitude. 


The locus of the roots is inferred to be that in the 
following figure. 


Journal of Basic Engineering 


march 1960 / 93 


Numerical example 
~999 1-978 860 | 984 
\ 
\ / | a} 
| 
- 1,2 -N+P-1 
P-1,Z=N+P <2 
: P-1 Z-N4P-=3 
4 
= 10.0 
ih 
R=1.0 (3) 
"3 


Numerical example 


a = —984 


Pair of 


complex 
roots for 


decreas- 
ing a 


a=978 


There is a second real 
root, always negative, 
with magnitude greater 

than 1 and less than 10. 


There is one real 
root for increasing 
G (positive for 


a < 3.65) 


Numerical Example 


In order to demonstrate the method, a numerical example is 
included. The example is a trivial one, selected only for illustra- 
tive purposes. The system is represented by two equations with 
one input and two outputs. The equations are shown with opera- 
tional coefficients by Equations (10) and (11) of the example. 
Specific values for the operators are given by Equations (12) 
through (15). In Equation (12) the constant term in the third- 
order operator is adjustable. The purpose of the numerical analy- 
sis is to show how bounds on the roots of the characteristic de- 
terminant set bounds on the values of this constant term. 

The characteristic determinant associated with the equation set 
is a fourth-order polynomial as indicated by Equation (16). The 
modified form of the characteristic determinant is shown as Equa- 
tion (17). A variation of the modified form is represented by 
Equation (19), substituting the polar form of the complex variable 
for the operator “‘p.’’ There is a single pole for the rational func- 
tion at the point (—1.1). Note that it is unnecessary to expand 
the characteristic determinant any further in order to use 
Cauchy’s theorem. 

Six representative Cauchy contours have been selected to 
bound the roots. Three of the contours are selected to establish 
bounds on the damping ratio and three on magnitude. The 
“damping ratios are taken as —0.7, 0, and +0.7 corresponding to 


94 / MARCH 1960 


the damping ratio angles of 45, 90, and 135 deg. The circle con- 
tours are taken to correspond to radii of 0.1, 1.0, and 10.0. 
Sketches of the Cauchy contours and the respective function 
plane maps are included in the example in parallel columns. The 
number of roots for each contour is related to bounding values of 
the adjustable constant a. For example, when the contour is the 
sector with half-angle equal to 45 deg, there are no enclosed roots 
for a greater than 3.65 and only one root for a less than 3.65. It is 
inferred that there is a real positive root when the adjustable 
constant is less than 3.65 and the value of this root is zero when a 
is equal tc 3.65. This inference is confirmed by the other two 
sector studies. 

When the sector half-angle is equal to 90 deg, there are two 
roots for a greater than 3.65. Since there are no roots for this 
value of the adjustable constant when the sector half-angle is 45 
deg, it follows that there is a pair of roots between the radii of 
angle +45 and +90 deg, and this pair must be complex conju- 
gate. The number of roots for the second sector increases to 
three when a becomes less than 3.65, indicating the presence of a 
positive real root. The number of roots decreases to one when 
the value of a becomes. less than unity. This implies that the 
complex pair of roots crosses the imaginary axis for a equal to 
one. 

By setting the sector half-angle equal to 135 deg, the next 
figure shows that the complex roots have an associated damping 
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ratio that is never greater than 0.7. The three sector studies 
suggest the existence of a real positive root for a less than 3.65 
and a pair of complex conjugate roots for all values of a lying be- 
tween +45 and +135 deg. 

The next three contours relate to the magnitudes of the roots. 
The interpretation follows as for the sector ccntours. It is to be 
noted that, when the radius of the circle is equal to 10, it is neces- 
sary to account for the pole at (—1.1). The studies also indicate 
the presence of a negative real root with magnitude greater than 
unity but less than 10 for all values of a 

It is interesting to observe the amount of detail that may be in- 
ferred from relatively simple computations. The results are 
summarized at the end of the numerical example. A possible 
locus of roots as a function of the adjustable coefficient is given 
at the end of the example. The known values of the adjustable 
coefficient are indicated. For many applications the sketch of 
the locus is quite adequate to guide further work. 


Conclusions 


An elementary procedure has been described and illustrated for 
estimating the roots of characteristic determinants associated 
with multicoupled systems. The procedure can be employed to 
determine the influence of variations of the system parameters on 
the roots. The method can be employed using hand calculations, 
but it is easily adapted to machine computations, particularly for 
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digital computers. The computation of the necessary points can 
be readily programmed to allow more time for interpreting the 
results. 


Acknowledgment 


The authors wish to thank Prof. Walter Wrigley and the 
Training Group of the Instrumentation Laboratory, Massa- 
chusetts Institute of Technology, for the opportunity to carry 
out the investigation and for the assistance provided by the 
Jackson & Moreland Technical Publications Group for preparing 
the manuscript and illustrations. 


1 E. M. Goldberg and H. D. Felsenthal, Jr., ‘Estimating the 
Dynamic Characteristics of a Multicoupled Physical System,” Re- 
port T-138, Instrumentation Laboratory, Massachusetts Institute of 
Technology, Cambridge, Mass., May, 1957. 

2 S. Lees, “A Method of Estimating Dynami Characteristics of 
Physical Systems,” “Frequency Response,”’ edited by Rufus Olden- 
burger, The Macmillan Company, New York, N. Y., 1956, pp. 201- 
207. 

3 E.A.Guillemin, ‘‘The Mathematics of Circuit Analysis,” Princi- 
ples of Electrical Engineering Series, Technology Press, Massachusetts 
Institute of Technology, Cambridge, Mass., and John Wiley & Sons, 
Inc., New York, N. Y., 1949. 


MARCH 1960 / 95 


q 
References 
4. 
a 
hie 


A. IBERALL 


Chief Physicist, Rand Development 
Corporation, Cleveland, Ohio 
Mem. ASME 


Human Body as an Inconstant Heat Source 
and Its Relation to Determination of 
Clothes Insulation 


Part 1—Descriptive Models of the Heat Source’ 


A precise characterization of the thermal resistance of clothes requires an accurate 
description of the static and dynamic thermal characteristics of the human-heat source. 


Experimental measurements on the human have revealed a frequency spectrum of sus- 
tained thermal power oscillations that mask theoretical long-time equilibrium adjust- 
ments. This points to the number of degrees of freedom that must be involved in the 
thermoregulation of the human, and the specific nonlinear characteristics of the system. 
Therefore at best, a resistance model for clothes is possible only as an ohmic relation 
among time-averaged equilibrium values, and for a specific mode of operation of the 


Introduction 


A CLIENT of the Rand Development Corporation re- 
quested development of a method for assessing, among a variety 
of clothes of interest to him, the differences in their ability to pro- 
vide protection against extremes of temperature. Because of 
some rather novel and controversial elements in the method of 
providing protection in these clothes, the client requested that 
the method be absolutely reliable and highly accurate. 

A review of the literature revealed the following methodology 
for describing and determining the “‘insulation’”’ value of clothes. 
Clothes are described as an equivalent “conductance.” For ex- 
ample, Gagge, Burton, and Bazett [1]? defined the ‘‘clo,”’ a unit 
of thermal insulation of clothing, as the amount of insulation 
necessary to maintain comfort (a mean skin temperature of 92 F) 
of a resting subject in a room at 70 F with air movement of 20 fpm, 
humidity not over 50 per cent, with a metabolism of 50 kilo- 
calories (keal) per sq meter per hr. Equivalently, they have 
taken this to mean that, on the assumption that 76 per cent of the 
heat is lost through clothing, a clo may be defined in physical 
terms as the amount of insulation that will allow the passage of 
1 keal per sq meter per hr with a temperature gradient of 0.18 
deg C between the two surfaces of clothing. Much of the ideology 
and experimental science that has gone into developing these con- 
cepts may be found in an authoritative book by L. H. Newburgh 
[2]; a book by Winslow and Herrington [3]; and in a published 
symposium on temperature [4]. 

Whereas as a goodly first approximation, there may appear to 
be little criticism of the “equivalent conductance” concept of 
clothing, for precise work a very carefully constructed scientific 
foundation in theory must be laid to justify such a concept. 


! This is the first of three papers that describe a methodology for 
measuring clothes insulation. 

2? Numbers in brackets designate References at end of paper. 

Contributed by the Instruments and Regulators Division and pre- 
sented at the Instruments and Regulators Conference, Cleveland, 
Ohio, March 29-April 2, 1959, of THe American Society or ME- 
CHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
22, 1959. Paper No. 59—IRD-7. 
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system. The validity of this hypothesis, however, has not been proved. 


The specific difficulties are to determine, not how to make a 
laboratory test of the equivalent thermal resistance of a sample 
of clothes, but, how the body determines its surface thermal 
boundary conditions so that its characteristics as a potential 
source are understood. In particular, since the body is a self- 
actuated thermoregulated system, what is needed is a dynamic 
description of its operation. It is in the domain of lack of pointed 
emphasis on the dynamics of the system that past descriptions 
of the effect of clothes are weak. 

Related difficulties existed in the science of color which were 
not resolved until distinctions were made between the psychology 
of color (how the eye characterizes color); the psychophysics of 
color (by what physical laws does the eye respond to or 
“add”’ color); and the physics of color (the physical laws that 
describe color). 

It is the purpose of this preliminary study to lay at least a 
phenomenological basis for a valid description of the thermoregu- 
lation of the human that will permit a precise characterization of 
the effect of clothes. 


Descriptive Models of the Human Heat Source as a 
Dynamic Thermoregulated System 
The chemically reactive thermal field that the body represents 


has been described by the following power balance: 


M=E+W+A(T, -—T,)+S8S 
where 


M rate at which energy is produced by the heat of 
combustion (i.e., metabolism) 
E = rate at which energy is lost through evapora- 
tive mass transfer (i.e., evaporation) 
W = rate at which external work is done 
S = rate at which heat is stored in the system (i.e., 
storage) 
K(T, — T,) = rate at which heat is conducted through an 
insulating sheath in contact with the body 
K = “equivalent”? conductance of insulation sur- 
rounding the body 
T, = “effective” skin temperature 
“effective’’ surrounding temperature 
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In this description clothes are characterized by their effect in 
determining the conauctance K. 

The physiological description of the thermoregulation of the 
human, given in the literature, describes three regions of regula- 
tion (illustratively, for a nude subject doing no external work). 
In the expressions that follow, zero subscripts are used to denote 
parameters that may be regarded as constant. Numerical tem- 
peratures given are only meant to be approximately correct 
values. 
Evaporative Control 

M,=E+K,(T,, — T,) 
for 
T, > 31 deg C 

T,. = constant (= 35 deg C) 


See [4], for example, pp. 33, 518, 549; or [2], pp. 142, 205. 

The implication has been permitted that E becomes a function 
of 7T,, say, through the potential mechanism of increasing the 
wetted surface area of the body. 


Vasomotor or Conductance Regulation 
M,=E,+K,(T, — T.) 
for 
31 deg > > 29 degC 


See [4], for example, p. 548. 

The implication has been permitted that the internal conduct- 
ance of the system changes, so that the skin temperature be- 
comes a function of ambient temperature. 


Zone of Body Cooling 
for 


29 deg C > T, 


The conditions of measurement are not at equilibrium. A heat 
debt is built up. It is left by logical implication that the control is 
through shivering and increased metabolism. See [2], p. 235, 
next to last paragraph. 

The defects in this general description are that it is not com- 
plete—it simply represents a partitional balance that contains no 
information about the dynamic state of the system; and the 
effective conductance is not satisfactorily treated—it does not 
lump all fluxes that may pass through the clothes. 

An elementary energy equation that may be written, more cor- 
rectly, is 


dH 


M=L+K(T, -—T,) +> 
+ (i, 


M = metabolism 
L = rate at which heat is lost through all other 
avenues than through clothes 
K(T, — T,) = rate of all sources of flux that pass through the 


clothes 
dH 
-_ rate at which thermal energy is stored in the 
‘ system 


The assumption that all sources of flux, passing through 
clothes, regardless of their further partition, may be described by 
an equivalent conductance is a hypothesis. Its general validity 
may be borne out by careful study of [6], chapter 28. It is shown, 
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is considerable similarity in the results of temperature difference 
driving forces and concentration difference driving forces. Thus 
under the assumption of small concentration gradients, the com- 
bined radiative, conductive, convective, diffusive exchange at the 
surface of the skin, between the skin and clothes, may be re- 
garded as functionally homogeneous in the temperature difference 
T, — T,. However, this does not necessarily imply that the rela- 
tive contributions of each mode of heat exchange remains a con- 
stant proportion of the whole (i.e., the equivalent conductance 
is no longer necessarily a constant). 

The second significant aspect of this description is that be- 
ginning to describe the system by means of a differential equa- 
tion creates the possibility of ultimate description of its dynamic 
performance. For example, this formulation states that the net 
rate of production of heat is instantaneously stored in the system. 
Since it is reasonable to assume that no phase change takes place 
instantaneously in the body, one may assume plausibly that the 
storage takes place in heat capacity of the system. Thus 


M —L-—K(T, —T,) = CW 
dt 


where 


C, = specific heat per unit weight (or mass) of the system 
W = weight (or mass) of the system 
T, = volume averaged temperature of the body 


To this must be added the thermoregulating assumption 


T, = constant (say 37 deg C) 
T, = the average value (in time) of “deep body’ tempera- 
ture (the regulated temperature) 


This equation set is phenomenologically sound. It carries 
with it the implication that, in a system which is regulated in some 
sense (while not yet explicit, it is now possible for 7, to make its 
appearance as an as yet undetermined portion of 7), a self- 
powered system accepts a burden of heat storage in order to regu- 
late its controllable avenues of heat exchange around a set point. 
If a more restrictive assumption is to be made, it may be, farther, 
that there is no other physically known means for regulating a self- 
actuated system. Thus intrinsically, the regulation of a system 
must be dynamic. 

In order to assess the order of magnitude of experimental ac- 
curacy that may be required for assessing the storage term, the 
following numerical values are assumed (for a cold environment) 


M, = 5O keal/m? hr T, = 31 degC 

L, = 10 keal/m? hr W = 30 kg/m? 

K, = S8keal/m*hrdegC OC, = 0.8 keal/Kg deg C 
aT, 


40 — &(31 — T,) = 08 X 307 


These numerical values have been chosen from data of Winslow, 
Herrington, and Gagge (1937) to give a “storage’’ of about —40 
keal/m* hr at 21 C. This loss represents a drop in body tem- 
perature of about 1.7 C/hr. To achieve an approximate desired 
accuracy of about 4 kcal/m* requires a measuring accuracy of 
the order of 0.003 C/min. Thus short term power measurements 
on the human body require rate sensitivities of the order of 0.001 
C/min in temperature measurement or control, which, perhaps, 
are not so remarkable with regard to the precision of the tem- 
perature measurement as they are with regard to the time sta- 
bility of the temperature measurement. 

In order to obtain some information about the long time scale 
for equilibrium results, it is necessary to make some additional 
assumptions. The crudest (single lumped time constant) model 
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of the system may be taken to be 


ar, 


M, —L, —T,) = C; 
( T,) = CW 


M, = CT, — T,) 


The second equation assumes that the system can be treated as 
a central core in which temperature is well mixed and which has 
a constant layer, for its outer core, of conductance C,. This 
model permits the assumption that: 


T, = aT, + dT, 
a+b=1 


Validation for such a model may be taken from [2], p. 128, in 
which Burton gives 


T, = 0.65T, + 0.357, 
Hardy and DuBois give 
T, = 0.87, + 0.27, 


These models are equivalent to assuming fixed spatial zones of 
temperature, in each of which, the temperature distribution con- 
sists of a separable spatial and temporal function. In fact [2], pp. 
132-133, justifies a linear temperature distribution in the periph- 
eral zone and a flat distribution in the core, in the case at least of 
the forearm and thigh. A linear model perhaps most validly per- 
mits 


T, = 0.75T, + 0.257, 


Since this result is close enough to the empirically justified 
measurements of Burton, and Hardy and DuBois, a two zone 
model may be regarded as having some merit. 

Eliminating 7’, and 7’, respectively, from the three equation set 


results in 
CH M, L, 


Cw\ . 
47, = K, + T, 


These single time constant results permit simple interpretation. 
Let the single time constant A denote 
C, 
K, 


A= 


These equations then state that the skin temperature is driven 
by the temperature drop created by the metabolism across the 
external resistance, plus ambient temperature, with a lag that 
depends on a single time constant. The deep body temperature 
is additionally driven by the drop of temperature across the body 
resistance. 

Of interest here is the time constant 


0.8 X 30 


8 


3 br 


Thus the primary lumped element time constant of the human 
is of the order of three hours. No transient can be fully expected 
to die out in less than three hours. 

These equations are still not satisfactory for a description of the 
system. If the system were a passive single lumped time- 
constant system like a liquid thermometer bulb (in which, to sim- 
ulate correspondence, one may imagine a constant power heater 
immersed in the mercury), the previous assumptions would be 
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reasonably well based and do, in fact, furnish the basis for as- 
sessing performance of glass thermometers. The glass is the re- 
sistance. It is assumed that the central core of mercury is well 
stirred by convection. The mercury and glass make up the heat 
capacity. The basic weakness in description, however, is that it is 
not for a regulating system. 

A better example might be taken of a thermostatted house, say, 
considered as a single lumped time-constant system to which has 
been added a simple on-off thermostatic ‘‘controller.”” One 
would find an exponential rise and fall in temperature; in re- 
sponse to a step function change of external temperature the on- 
off cycles would have to be observed for a time greater than, but of 
the order of, the longest lumped single time constant of the system 
as a whole before a steady-state on-off cycle were achieved. 
Roughly, these results may be regarded as a dynamic description 
of the simplest regulating systems. 

The literature was therefore searched for dynamic data taken 
on the human. It is suggested, by implication, that, in the 
vasomotor and evaporative zone, a passive equilibrium is achieved 
in which the deep body temperature remains constant. While 
time-dependent data (sparse in quantity) perhaps indicate dif- 
ferenily, there is no other useful interpretation given of the 
presentation adopted by Winslow, Herrington, Gagge, and Hardy 
and DuBois in representing the partition of the avenues of heat 
versus operative temperature. 

The lack of pointed emphasis on the time domain of measure- 
ment and response, while stressing the reproducibility of results, 
permits only the logical inference of static or quasistatic re- 
sults. Thus it was possible only by detailed examination of a 
limited number of data to derive some a priori ideas on the true 
dynamic state of affairs in human thermoregulation. 

For example, in the zone of body cooling, it is stated that there 
is a loss in storage, and it is suggested that ultimately a rise in 
metabolism makes up for the loss. In particular, data of DuBois 
[4], p. 37 suggest that the regulation is that of a simple on-off 
controller in the zone of body cooling. More particularly, the 
figure suggests that the exponential rise and fall of an on-off con- 
troller operating around a thermostat set point occurs with a time 
constant of the order of an hour. 

The unsatisfactory state of a description of the dynamic charac- 
teristics of human thermoregulation therefore made it necessary 
to perform some primary and basic experiments on the human. 
The guiding assumption was made that what might be expected 
generally in any simple thermoregulated system is that the 
storage indicator—here the rate of change of mean body tem- 
perature—will give some information about the dynamics of the 
system. The level of man temperature (definitely, the level of 
deep body temperature) becomes uninteresting. Control is im- 
plied in the rise and fall of mean temperature around the set 
point. However, in order to have consistency with the first law 
of thermodynamics, it is necessary that equilibrium in some 
period of the order of 3 hr be found. 

The average body temperature was the first parameter chosen 
for experimental investigation. A number of thermocouples, 
connected in series to sum up temperature readings, were in- 
serted in a subject resting quiescently, through all easily available 
re-entrant openings into a human. These included the urethra, 
the esophagus, and the skin itself. Number 40 copper-constantan 
thermocouple wire was used. 

The basic conclusion from these experiments (performed first 
in the regime of body cooling and then in the vasomotor and evap- 
orative regime) was that the “noise” far outweighed the ex- 
pected response to step functions. After exhausting most 
sources of possible criticism of experimental technique, it was 
finally concluded that this was not noise and that a real frequency 
spectrum of thermal oscillations existed. Specifically, there 
appeared to be a number of temperature cycles with periods equal 
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to or longer than a 90-sec period. These faster cycles by far 
masked any long time (say 3-hr) cycle. As another tentative 
working hypothesis, it appeared more nearly true to regard skin 
temperature as being regulated than not, although in some as 
yet undisclosed sense. A similar tentative conclusion might be 
drawn from the figure on p. 564 in [4]. 

Thus basically in these experiments, which were considerable in 
number, and which, beyond the major observations made, are 
not worth more extensive description, it was finally concluded 
that a static model of the thermoregulation of a human was hope- 
less; that a simple single time constant on-off controller model was 
hopeless; that in all temperature regimes there is a cyclicality 
which must be taken into account; that the dynamic equations 
must be nonlinear to account for the hard operating conditions 
(see [5] for background); that the periodicities approximately 
noted could not be explained by purely passive thermal elements 
with a constant heat of combustion source, and by simple ther- 
mal lags; that there is a considerable number of degrees of free- 
dom in the system; and that the sensing and control functions 
that take place in the body have not been adequately described 
in a phenomenological sense. 

These conclusions are critical of previous work in this field. 
The intent is not to suggest that previous work has had no merit, 
but to point up that the jig-saw puzzle of how thermoregulation 
is achieved requires rearrangement. A correct end solution must 
in good measure validate all previous work, but the new assembly 
of ideas must stand out to the reader, in particular to the physio- 
logical reader, in such sharp fashion that he can re-examine the 
physiological evidence and create suitable justifications of the re- 
assembled story, or criticize its indefensible elements. In this 
context, he must be sympathetic toward the physical arguments 
by which the story is reassembled. 

Therefore it is necessary to review the theoretical background 
in creating a suitable phenomenological model. The most precise 
energy equation that one may assume for the human body must 
be derived from a field equation. (This is the same procedure 
that, in principle, must be used and is used to “justify’’ Ohm’s 
law.) By starting with a hydrodynamic field energy equation in 
which chemical reactivity is permitted, and by adding suitable 
assumptions that seem to be obeyed in the body, a simplified 
equation is finally obtained in field form that may be integrated 
over the entire body. However, certain comments must be 
made. The body processes are intermittent (they certainly are 
keyed at breathing frequency). The problem is to replace them 
by an equivalent continuous process. The following model pro- 
visionally may be assumed as the most suitable mathematical 
boundary to take for the body that anticipates this problem. It 
is illustrated in Fig. 1. The boundary will be taken as a 
“smooth” boundary fixed in space, corresponding to the position 
of the skin averaged or filtered in some small interval of time, 
that has no resultant folding, and which is re-entrant into the 
lung cavity. This integration boundary now may be assumed 
to bound only liquid or solid. If the convention is taken that 
any other gas pockets are lumped into the lung cavity, then as a 
goodly approximation, one may assume the enclosed body sub- 
stance to be incompressible. 

Integration over the entire fixed volume envelope then results in 


aT, aT 


with respect to the inside boundary of the envelope. 


we, 


W = weight (or mass) of the body 
C, = specific heat of average body tissue 
T, = 


averaged (over liquid and solid) volume weighted tem- 
perature throughout the body envelope 


Journal of Basic Engineering 


total heat of combustion (averaged over the filtering 
time interval) produced in the body 
body conductivity at the inner surface of the envelope 


normal surface averaged temperature gradient over the 
inside surface of the envelope 


A, = area of the fixed “‘smooth’’ volume envelope 


These terms now must be assessed at the exterior boundary. 

The conductance term is assessed by continuity of flux at the 
boundary. It may be broken into two parts, one over the outer 
skin, and the other re-entrant into the lung cavity. 

Consider first the loss from the lung cavity. The heat exchange 
there is intermittent. However, by the concept of the instanta- 
neous filtered average, it can be replaced by an equivalent con- 
tinuous heat exchange. The filtered average of an equivalent 
continuous flow process is represented by the heating of the total 
wet-gas equivalent of inspired air (the minute volume) assessed 
as all air, from ambient temperature 7’, to the mean deep body 
(constant because thermostatted) temperature 7’, plus the latent 
heat of evaporation required to saturate the expired gas. 

It may be shown, ultimately, that the approximate heat ‘oss 
from the lungs is given by 


0.095[1 — 0.011(37 — T,)|M 


M = metabolism (as measured at a measuring station) 
assuming zero humidity ambient air. 

The loss through the outer skin, by the ideas previously dis- 
cussed may be written as 


A(T, — T,)A, 


where 


@ reentrant fold boundary 


Detail om the convention 
of ‘‘smoothness”’ 


Fig. 1 Prototype of a stationary model for the mathematical boundary 
of a human (re-entrant into the lung cavity) 
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over-all effective heat-transfer coefficient 
external smooth boundary area 


The heat of combustion H is the integrated total instantaneous 
filtered average. However, for the process of measuring, the 
measuring or the control station—if the body controls the 
metabolism—most probably is located at discrete points. Thus 
there are process lags. Most probably these transport lags are 
distributed lags. However, even the simplest model (a single- 
time-constant model) would require that the measured metabo- 
lism M (say as measured by gas exchange in the lungs) and the 
heat of combustion H are related by 


M =H +4,H 
\,, = asingle time constant 


or more generally 


M =(0+1)4 


O = asuitable differential operator. Its necessary property 
is that the mean value of O(H), in time, is zero, so 
that M and H average to the same value 


Thus the simplest possible equation might be illustrated, sche- 
matically, by 


wo. = (0 +1)M —fM — K(T, — T,) + O(T,) + OAT.) 


where 


f = fraction of metabolic heat that does not pass through 
the clothes 
O,2 = suitable differential operators on T, and 7’, respec- 
tively. These terms average to zero in time 


If it is assumed that these equations are linear, except for the 
clothing term, then one might expect, either exactly or ap- 
proximately, that the time average of this equation would result 
in 


R(T, T,) = Ma —f) 


The bar notation denotes time average. These equations replace 
the looser definitions given previously. 

As a matter of fact, this is as far as one may go a priori. These 
equations are now thermodynamically correct. They say that 
in a cyclic system, which hunts around a regulated variable (not 
now necessarily a conservative system), there exists a modest 
period of time (if not too long so that the loss in weight is not 
excessive) over which equilibrium energy exchanges should take 
place. In the dynamic equation, the possibility exists of a variety 
of coupled sources of oscillation. 

Thus while the time-averaged over-all thermal equation holds 
promise for a definition of the effect of a load, the problem still 
remains in what true sense may this equation be used. It may 
not be used during one breathing cycle (say, 4 seconds). The 
previous estimate indicated that it might have to be used over a 
period as long as three hours. Thus in order to be able to pro- 
ceed with any confidence, in the best sense in which to use the 
time-averaged equation, one must have at least a phenomeno- 
logical basis for the true dynamic equations. Translated into 
“practical” terms (at least for a mathematical physicist) this 
means that one must be able to discuss in physical or mathemati- 
cal terms the degrees of freedom of the system, the types of 
mechanisms, and their characteristics that might go to make up 
the governing nonlinear differential equation. It is only by a 
description of the dynamic thermoregulation of the human that 
the effect of clothes as a load on the system can be understood. 

The sparse dynamic data in the literature were reviewed again. 
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It appeared quite clear that, in some undisclosed sense, system 
variables seemed to include metabolism, the effective wetted- 
surface area, the body conductance, the skin temperature. 
Since the experimental data that had been taken indicated con- 
siderable “high-frequency’’ harmonic content (at periodicities in 
the minutes level) that, by far, masked the low single-time- 
constant frequencies (cycles in the hours range) and that could 
not be ascribed to “‘noise,’’ the basic problem seemed to lie in 
accounting for high-frequency phenomena. It is believed that 
a major conceptual breakthrough occurred from considering the 
problem in this light. 

It is already implicit that the real time constants (whether 
zonal or not) are too slow (the fluid-exchanger system is of no 
help against the large capacitance of the system and boundary 
resistances), On the other hand, the electrical (nervous) sig- 
naling is much too fast, as is the fluid-circulation system. There 
is an intermediate range of modest speed that must get at time 
constants associated with fluid transport, and slower diffusive 
processes. The problem was where to find a mountain from 
which to view the terrain. Very plausibly one may start from the 
zonal concept. The implication in 


T, = aT, 4. bT, 
and in particular that 
T, = 0.757, 0.257’, 


is that a considerable degree of linearity in the variation of skin 
temperature through the outer periphery and flatness in the inner 
periphery exists. More particularly, the approximate mean 
equivalent depth over which linearity would be required in the 
skin zone is lin. The first immediate thought, also suggested in 
the literature, is that this would correspond to the “peripheral 
vasoconstrictive’’ zone of the body; i.e., a zone that encompassed 
through the fat layer to the fascia. Direct clinical evidence ap- 
peared to exist in [2], pp. 132, 133, of temperature probes into 
the forearm and thigh, and some other incidental data in depth 
surveys through the anus, vulva, possibly the urethra and 
esophagus. Mouth data indicate that “in a short distance’’ one 
is “substantially” up to deep body temperature. The first ref- 
erence, however, is the most important in establishing in some 
quantitative sense, a real gross linearity in temperature through 
the outer zone. One significant detail was originally overlooked 
which will be soon mentioned. To a crude extent the limits of 
linearity extended to the order of magnitude of the fascia. Con- 
sistency is thus achieved with regard to a vasoconstrictive-control 
concept that brings the vasomotor zone into compatibility with a 
two-zone concept of temperature distribution. However, here 
arises a major physical paradox. A linear (this includes a flat) 
temperature zone implies negligible distributed heat production 
(at least averaged over time). The presence of a distributed 
fluid-exchange system could only go to produce a stationary 
average instantaneous distribution (i.e., temperatures may only 
rise and fall in unison) in the radial dimension. In any case then 
the heat production must be concentrated in the transition zone 
between the linear temperature distribution of the outer zone, 
and the flat temperature distribution of the inner zone. Thus 
there must intrinsically exist three zones, with a relatively con- 
centrated middle zone. Structurally, the only mechanism that 
could exist at that depth was the muscle system! Thus all of a 
sudden another piece of the jig-saw puzzle fell into place. Re- 
examination of the previous data now began to make sense. 
The end of the linear temperature zone is not ‘‘in the vicinity of 
the fascia,’ but is “in the vicinity of the middle of the muscle 
system.’’ As a physicist reporting, any responsibility is dis- 
claimed for the validity of the following concept. However, a 
biological colleague pointed out that basically one could regard 
the human system from its embryological development, as made 
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up of three systems, in which the ‘‘muscular system”’ is part of the 
mesodermic layer. Thus from a physical point of view, it began 
to make considerable sense that the major source of heat produc- 
tion and control is lodged in the ‘‘muscular’’ system, and that this 
should not be interpreted as an equivalent uniform sheath 
geometrically located (except for equivalent model purposes) but 
as a system roughly organized as a sheath with re-entrant pocket 
attachments to such mesodermic organs as the brain, liver, heart, 
and so on (which have always been noted as having slightly 
higher temperatures than deep body teraperatures). It is obvious 
that the reason for such organization for other biological pur- 
poses is not in the domain of physics so that it is simply ac- 
cepted as an empirical fact. However, a much more palatable 
picture is now obtained, from a control point of view, of how an 
inner zone is regulated with a flat temperature distribution inside. 
It is the same system that would be used in a laboratory. An 
inner zone consists of a well-stirred liquid bath, with possibly a 
minor amount of immersion heater distributed for fine control. 
However, major control lies in an electric heater wound around 
the inner zone. By virtue of its forcing a conductance of heat 
inward (it is at higher temperature) it creates a concave upward 
distribution of temperature into the center. Modest central heat- 
ing creates a concave downward distribution. Thus a relatively 
fiat central distribution is achieved, by virtue of a considerable 
degree of harmonic compensation (as in a Helmholz coil magnetic 
field, there is considerable ripple but it averages about a constant 
value). Further plausible details in a laboratory thermal con- 
tainer are a well-stirred liquid bath, a high degree of insulation 
outside, and at least a modest degree of temperature regulation 
of the outer layer. 

Ascribing now the “major source of heat production” into the 
“muscular’’ system forces another consequence. The heater must 
be controlled in a lively fashion. In an electrical heater 
system with an external power supply, this would be easily 
achieved by a rheostat control permitting easy and rapid swings 
in power level. Unfortunately in the body system, what is needed 
is that the power supply or prime mover (i.e., the steam plant, 
turbine, generator) must be part of the system. Equilibrium 

saction chemistry where the temperature is quite regulated, so 
that wide rate changes cannot occur from temperature changes, 
preclude equilibrium reaction-rate shifting, although transport 
of catalyzers triggered by nerve impulses have more plausibility. 
However, the basic problem is that, if chemical plant equilibrium 
(in the cells) exists in a limited sense, a ready supply of fuel must 
be made available and converted at a temperature equilibrium 
rate into heat. The basic physical way of translating this is that 
the only thing that can respond fast enough is a heat engine! In 
analog to a system which is beginning to come into prominence in 
aircraft, what must happen is that an engine must be running at 
an idling rate (producing heat just at its standby frictional level— 
i.e., the cells are being ‘fed’ energy); and in response to a demand 
for heat, a work load must be thrown onto the engine. Thus 
the engine is always available for doing work. However, this 
work is not yet available as heat. The work, however, is called 
for by a device—say a Prony or an electric brake—which im- 
mediately degrades this work into heat. In the body, pre- 
sumably the muscles become a chemical heat engine in which the 
entire chemical panoply of reactions (phosphate reduction, re- 
generation, and so on) is taking place through the ‘“‘mechanical’”’ 
moderation of internal muscular activity. Thus the best point 
of view one might postulate is that the muscles are always 
“shivering’’ subcutaneously, degrading internal mechanical work 
into heat in proportion to signaling demand. This then pro- 
vides a lively enough engine (i.e., in the minutes or even seconds 
scale; however, information for electrical signaling may arrive 
only at a diffusive time scale so that a time scale in the minutes 
range only may be achieved). Thus then, metabolism is a 
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lively changing quantity rather than a static constant quantity, 
that may be running standby in part of an external temperature- 
load range but is ready to go into action to produce ready heat if 
it gets too cold or if external work is required. The muscles are 
thus always in an active state of operation. 

Thus an equivalent stationary temperature distribution has 
now been validated as describable by 


T, = aT, bT, 


but with a new interpretation. The distribution derives from 
three zones, not two, and these are only averages in time. It is 
not necessarily true for the instantaneous filtered average; i.e., 
time lags of modest degree may exist. This could mean specifi- 
cally that terms involving mean body temperature or its integral 
may be replaced mathematically by a linear combination of skin 
and rectal temperature, but not under the mathematical operation 
of differentiation. 

These remarks are as far as one may go toward establishing a 
satisfactory a priori model for measuring clothes conductance. 


Summary 


In order to know how to measure the thermal “‘resistance”’ or 
“conductance”’ of clothes as they affect the human, it is necessary 
to know something about the physiological physics of the human 
as a potential heat source. On one hand, the large heat capacity 
of the system and rough estimates of the thermal resistances 
suggests that a period of time of the order of 3 hr may be required 
for thermal equilibrium. On the other hand, experimental 
measurements have indicated a considerable number of important 
sustained thermal and thermal power oscillations with periods as 
short as 90 sec that mask long-time adjustments. These point 
up what may be expected in a self-actuated regulated (here 
thermoregulated) system that the system must be nonlinear, and 
that equilibrium data from such a system can only be obtained 
by operating the system in some rationally understood mode. 

This preliminary study has suggested that the only hope at 
present for determining the conductance of a set of clothes is 
according to the definition 


M-L 
= 
T, = constant ambient temperature 
T, = average external skin temperature, averaged over a 


long period of time (at least three hours) 

L = thermal power losses that do not pass through the 
clothes 

M = metabolic power generated by the system, averaged 
over a long period of time 

R = total average effective conductance of the entire sheath 
(clothes plus air) surrounding the human 


Since the major loss that does not pass through the clothes is 
the loss from the lungs, which is essentially a constant function f 
of the metabolism, this expression may be written 


It has not been demonstrated or proved how the system should 
be operated or monitored to give the conductance a rational mean- 
ing since the dynamic characteristics of the various control ele- 
ments of the body have not been described. However, it has 
been possible to begin to create a rational model for the system 
that would begin to account for the high frequency power oscilla- 
tions found experimentally and to account for the success of a 
two-zone temperature model that has been used to describe the 
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human system (a linear temperature distribution in the outer 
body sheath and a flat distribution in the core). It has been 
postulated that there are three zones in the body, the outer 
vasoconstrictive zone, the inner core, and a middle (mesodermic) 
“muscular” system in which the major metabolic heat is pro- 
duced. This muscular system is always “shivering” and produc- 
ing thermal power oscillations as a heat engine. This very active 
middle layer may therefore restrain core temperature in a time 
domain much shorter than 3 hr. Since these power oscillations 
must be limited by some as yet undisclosed control function, the 
possibility of determining clothes conductance is enhanced if a 
rational external work schedule can be arrived at for the system. 
At this point none is suggested. 
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Human Body as an Inconstant Heat Source 
and Its Relation to Clothes Insulation 


Part 2—Experimental Investigation Into 
Dynamics of the Source’ 


Quantitative measurement on the human in the so-called evaporative, vasomotor, and 
metabolic control regimes has revealed frequency spectrum of sustained thermal power 
oscillations with approximate periods of 2,7, 35 min, and 3'/; hr independent of the 
regime. Step function adjustments take place with a time constant of about 7 min. It 
is believed that the 3/2 hr cycle represents the shortest equilibrium cycle. The hypothesis 
that it might be possible to measure the resistance of clothing as an ohmic relation among 
time-averaged equilibrium values, and for a specific mode of operation of the system has 
now been put in rational context in the time domain. Two equilibrium modes of the 
human system were explored. The active mode of operation of the system, to which the re- 
sistance concept of clothes is most applicable, is as a feedback system in which the 
extremities are used as error indicators of deviations from a comfort level set point. In 
response to deviations, the human feeds back a signal to generate an activity level in which 
only internal work—immediately degraded into heat—1s done to maintain the comfort 
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I. AN EARLIER PAPER [1],? some tentative charac- 
teristics were derived for a dynamic model of the human as a heat 
source. It was suggested that, if an equilibrium thermal cycle of 
the order of 3 hr could be found in the human, it would be possible 
to determine the conductance of clothes as an ohmic relation 
among time-averaged equilibrium values of flux and thermal 
potential. However, the nonlinear operating characteristics 
of a self-actuated (here thermoregulated) system require 
specification of the mode of operation of the system. In the 
usual quiescent mode of operation of the system described in the 
literature, it is not clear how the independent variables of 
metabolism, evaporative heat loss, skin temperature, and body 
conductance vary dynamically to establish a long-time equi- 
librium. Particularly, it is not clear how these variables adjust 
as a load on the system (here clothes) is changed. The purpose 
of this study was to determine experimentally how the human 
system may be operated to obtain rational and comprehensible 
values for the conductance of clothes. 


Experimental Exposé of Dynamic Thermoregulating 
Characteristics of the Human 


The previous paper [1] touched on some preliminary qualita- 
tive experiments made on the harmonic content of average body 
temperature. At this point, it was timely to attempt to measure 
experimentally and quantitatively what the harmonic content of 


} This is the second of three papers that describe a methodology for 
measuring clothes insulation. 

2? Numbers in brackets designate References at end of paper. 

Contributed by the Instruments and Regulators Division and pre- 
sented at the Instruments and Regulators Conference, Cleveland, 
Ohio, March 29-April 2, 1959, of Tue American Society or Me- 
CHANICAL ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
22, 1959. Paper No. 59—IRD-8. 
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This is referred to as the comfort mode of operation of the system. Another 
”* mode of operation of the system is also described. 


the system might be. It is already implicit in treating any com- 
plex system that such harmonic content would make itself evi- 
dent in the response of any variable that could clearly show the 
dynamic characteristics of the system. In particular, it was de- 
sirable that changes should be shown in the steady-state level 
with step-function changs of load, due either to changes of ex- 
ternal temperature or changes of external clothes. Because of 
the considerable doubt cast on clothes measurements, it was de- 
sirable that the simplest step changes possible be explored. Thus 
ambient temperature as a step variable was chosen, under a con- 
dition of wearing a constant set of light clothing. Since doubt 
had been cast already on whether skin temperature was or was 
not a controlled variable, and since both skin temperature and 
rectal temperature required considerable sensitivity of measure- 
ment, i.e., sensitivities of the order of 0.001 deg C per min, which 
could raise doubts in the reader’s mind as to the measurement 
accuracy, the measurement of skin temperature was ©:.cluded. 
(However, it is noteworthy to call attention to the reader the ex- 
perimental observation made in dynamic measurements of skin 
temperature that as one regulated a sensitive potentiometer to 
null balance on a subject, it was easily evident that a cycle of 
perhaps 90 sec was going on in skin temperature. The observer 
would soon almost begin to feel the “pulse” of the cycle.) Thus 
‘f{nstant metabolism’? was chosen for the variable to be ex- 
plored. Subject, of course, to criticism, actually instantaneous 
air consumption was chosen for the variable to be explored. This 
is heuristically referred to as metabolism on the theoretical as- 
sumption, checked by experimental validation with an oxygen 
partial pressure meter (Pauling meter) measuring in the exhaled 
breath, that the variation in partial pressure at say the end com- 
ponent of each breathing cycle reflected changes that were only a 
small modest percentage of the changes indicated in air consump- 
tion (the metabolism may be computed loosely as the product of 
air consumption and the fractional change in partial pressure of 
oxygen from inspired tank air to expired air). The basic system 
chosen was thus an open system measuring demand on the high- 
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pressure supply side. This was chosen because of the cleanness 
of time lags, and the precision with which rapid changes could be 
easily detected. 

The basic supply system thus consisted of a high-pressure tank 
of known volume, a precision pressure gage teed into the tank 
(accuracy 0.1 per cent), a pressure regulator, a shutoff valve, a 
demand regulator, a known volume and precision pressure gage 
teed in between the demand regulator and shutoff valve, or a 
high-pressure, differential-pressure flowmeter teed into the same 
point, and a small demand mask. 

The first basic experiments were designed to determine the time 
domain in which significant dynamic phenomena occurred. Thus 
in a first experiment, a small volume was chosen for the volume 
intermediate between the shutoff valve and demand regulator. 
This volume was repetitively charged to about 100 psig from the 
supply tank each time the pressure fell to 50 psig (the minimal de- 
sirable supply pressure for the demand regulator). Each breath 
would lead to a significant decay of pressure; i.e., about 50 
breaths would be taken per 50 psi charge. By plotting supply- 
volume pressure at say the end of exhalation against time, seg- 
mental portions of pressure decay, each approximately 200 sec 
long were obtained. From slopes, the ‘‘instantaneous,’’ i.e., 
averaged over one, really one or two breaths, air consumption 
could be obtained. From a few hours of such data, it was as- 
certained that nonrandom cycles in the 1'/: to 2-min range, in the 
5 to 8-min range, in the 25 to 50-min range, and possibly in the 


°3 to 5-hr range, existed. Randomness of cycle commensurate 


with the experimental accuracy, in the two or three breathing 
cycles range of time, seemed to exist. In particular, it appeared 
that the first reasonably “‘solid’’ response could be obtained by 
averaging over approximately five breaths, i.e., over an approxi- 
mate period of time ranging from 20 to 30 sec. Thus this repre- 
sents as a first approximation the minimum period of time or the 
number of breathing cycles over which one must integrate for 
“instantaneous filtered averages.’’ The approximate dilemma 
that this sampling period presents is that one is caught between 
choosing not too short a period of time that the “random high- 
frequency noise’ (“random’’ erraticness, if any, of individual 
breathing cycles) will disturb results, and the necessity for choos- 
ing not too long a period of time that the first significant har- 
monic (which appeared always to lie in the 90 to 130-see range) 
could be determined with reasonable accuracy. Thus all subse- 
quent work oscillated in a choice of sampling period referenced 
either to a 20 or the 30-see sampling period. 

Segmental sections of pressure decay for about 5000 sec are 
shown in Fig. 1. Also shown are the results of differentiation ob- 
tained by taking running five-breath averages. Cyclicality is 
quite evident. 

In order to insure that the results achieved by graphical dif- 
ferentiation were real, i.e., to free these results from any question, 
an instantaneous averaging flowmeter was constructed and in- 
stalled just upstream of the demand regulator. The shutoff valve 
was eliminated and replaced by a rather large volume to help filter 
out any pressure-reducer pressure ripple. (The reducer was 
chosen and set to maintain an extremely constant supply pres- 
sure). The instantaneous flowmeter consisted of a carefully 
designed linear viscous flowmeter of parallel capillary tubing 
whose response would be very purely proportional to volume flow 
at the constant supply pressure. The differential-pressure gage 
used to indicate flows (at a high absolute pressure, which was 
varied from 50 to 150 psig to change flow sensitivity) was care- 
fully fitted with a balanced combination of linear pneumatic re- 
sistances and capacitances to produce differential pressure indi- 
cating lags at levels in the 10 to 40 second range. Thus the 
differential pressure was averaged (or filtered) over corresponding 
periods of time. By plotting the flows thus indicated, the reality 
of the dynamic cycles was validated, at least for the faster cycles. 
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The absolute accuracy of this instrument, in particular the dif- 
ferential-pressure gage, did not warrant considering it for primary 
use, since too wide a differential scale range is required (it must 
practically extend to cover peak breathing flows, say perhaps in 
the 60 to 90 lpm), while at the same time requiring high per- 
centage accuracy at low mean level flows, (which might be at the 
3 to 4pm flow level). To cover the 20 to 1 flow range (in the dy- 
namic measurements made, at least 8 to 1 variations in instantane- 
ous filtered flows were found), required rather frequent changes 
in sensitivity, achievable by changing the absolute pressure, so 
that these measurements were interpreted only as rough quantita- 
tive indicators of the reality of cyclic changes. From these ex- 
periments, it was only possible to confirm that either the 20 or 
30-sec averaging period had comparable probity in indicating the 
“100” second cycles and, since the 20-sec period still showed 
modest breathing pulsation, the 30-sec period seemed to have a 
little more merit; i.e., 7 breaths are perhaps a shade better than 
5-breath averages. 

From these results, the form of the first primary quantitative 
experiment was cast. A decay volume was rechosen (approxi- 
mately 2 liters) for coupling into the line between the shutoff 
valve and the demand regulator. A precision 300-psi test gage 
was chosen for indication. This instrument permitted readings 
with a sensitivity of about '/2 psi, and an accuracy of about 1 psi. 
If an approximate consumption level of 5 lpm ntp, and 15 breaths 
per min is assumed, then a decay cycle (from 200 psi to 50 psi) of 
about 4 min is obtained; permitting approximately 8 '/.-min de- 
cays, of about 20-psi decays each. Thus the accuracy of an in- 
dividual reading is of the order of 5 per cent. Since each re- 
filling of the cylinder from 200 psi to 50 psi involves an adiabatic 
heating of the gas, the time for re-establishment of ‘‘good enough” 
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Fig. 1(a) Air consumption of human subject at 26 C (by decay of supply 
pressure in a fixed volume) 
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equilibrium was investigated and found to be about 30 sec. It 
was thus decided that a serial recording would be made at 30-sec 
intervals in which a recording interval would be skipped each 
time the cylinder decayed to the vicinity of 50 psi and, instead, 
the cylinder would be refilled and the time permitted for equi- 
librium. 

The primary experiment performed was a step function of a 
young “healthy” athletic male (18 years old) wearing light 
clothes from a “normal” temperature environment of about 75 F 
and activity level to a quiescent level, while lying horizontally in a 
small room held constant at a series of preselected tempera- 
tures. The bed chosen was an open network of a minimal amount 
of '/,-in-diameter hemp rope and '/s-in-diameter clothes line in a 
wooden box frame mounted about 2 ft from the floor. The bed 
was designed to “give’’ very little so that the chance for much 
vertical motion was quite small, and to possess very little heat 
capacity or too good thermal conductivity. In other words, the 
subject was mounted as close as possible on a nonyielding mass- 
less plane, with instructions to move about minimally. Tem- 
perature control in the room was achieved by use of a judicious 
choice of room air conditioners, heating elements, and fans to give 
a broad circulation pattern in the room with only modest breeze 
over and under the subject. The subject was shielded from 
radiant surfaces at higher temperature, and as many other minor 
precautions were taken as possible, to assure that toa large extent 
the subject was immersed in a “black-body enclosure” of a tem- 
perature corresponding to air temperature. Temperature control 
was manually adjusted to as close to 0.1 deg C as possible. Thus 
the basic interpretation that these experiments permit is relative 
data on the dynamic course of air consumption of a “constant” 
quiescent subject wearing “constant” clothes in a 
stant” temperature at various temperature levels. One experi- 
ment was done each day starting under comparable conditions of 
time, outside temperature, and diet. Each experiment was run 
for somewhat more than 5 hr. 


“con- 


To a cruder extent the air consumption is a mirror of metabolic 
change. To verify this, a relatively rapid responding Pauling 
meter (with time delays of the order of '/: sec) was connected 


into the outlet of the demand mask in a tube furnished with a 
light check valve; the check-valve resistance was chosen to be 
sufficient to bleed a sample of exhaled gas through the Pauling 
meter. The oxygen partial pressure was noted at the end of each 
expiration cycle and averaged over five breaths. By multiplying 
the instantaneous air consumption by the oxygen partial-pressure 
change from inhalation (tank) oxygen partial pressure and the 
exhalation partial pressure, a quantity very nearly equal to the 
oxygen derived metabolism is obtained. This was done for 
selected experiments to note that there is no tremendous dif- 
ference in dynamic performance between air consumption and 
“metabolism,”’ Thus in the light of the general philosophy, that 
any indicator of changes in any generalized displacement (or of a 
co-ordinate corresponding to a degree of freedom) of a complex 
mechanical system allows inspection of the dynamic performance 
of the system, the first primary data present only relative air 
consumption, to retain the highest possible degree of accuracy. 
It is proposed later to build a more complex (and expensive) re- 
breather system that may have comparable accuracy for record- 
ing over the length of test period required. 

The data from four such tests are shown in Fig. 2. These data 
were taken at room temperatures of 12, 21.5, 26, and 35 C, respec- 
tively. In each of the curves are shown the individual '/:-min 
averages, expressed as a decay rate in psi/minute (decay of air 
pressure in the 2-liter storage volume); and four steps of filtering, 
obtained in each case by graphically averaging the individual 
cycles obtained from the previous filtering. 

The obvious most noticeable characteristic of the primary data 
is the rich harmonic content of the air consumption. The second 
most noticeable characteristic is the similarity in harmonic content 
in all temperature zones, whether in the “metabolic” zone, the 
vasomotor zone, or the evaporative zone. Other characteristics 
that may be noted are that the significant harmonics seem to be 
discrete in number and essentially the same in frequency and 
amplitude independent of temperature, that the initial transient 
response is an approximate dampened oscillation at one of the 
salient harmonic frequencies, that these harmonics persist in- 
definitely in time (i.e. that the system is nonlinear), that the 
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Air consumption derived from Fig. 1(a) (running five breath averages) 
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Fig. 2(a) Air consumption of human subject at 17 C ambient temperature (expressed as the number 


of psi/min of air used from a 2-liter volume) 
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time for an equilibrium cycle is quite long (the data suggest that 
it is not less than three hours), and that the mean level of con- 
sumption changes significantly with temperature. The data als 
suggest that the order of magnitude of the instantaneous filtered 
average “metabolism”’ fluctuates by a factor of two in a “steady- 
state”’ situation independent of the control region. 

The basic tentative inferences that were drawn from these data 
are that the major thermoregulation in the body derives from a 
lively on-off heat engine (the muscles) which sheath the internal 
zone of the body, and that other auxiliary controllers control the 
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Fig. 2(b) Air consumption at 21.5 C 


level of internal temperatures through at least three other control 
v hanisms, and that the shortest time over which thermal 
-quilibrium exists is of the order of a few hours. Thus finally for 
the first time some idea was available as to what time domain one 
must average over in order to apply equilibrium assumptions. 
As a sharper test, both of this hypothesis, and of the apparent 
demonstration of dynamic cyclicality in human response, an in- 
tegrating system was constructed, in which a medium-sized supply 
tank (approximately */; cu ft internal volume) was used as a sup- 
ply volume, and connected directly to a demand regulator with 
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Fig. 2(c) Air consumption at 26 C 
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Fig. 2(d) Air consumption at 35 C 


a precision 0 to 3000 psi (0.1 per cent accuracy) gage teed into the 
tank. By this means, the air consumption over a longer period 
could be averaged. In particular the average over increasing time 
in the presence of a dynamic repetitive bounded cycle should be 
a damped oscillation whose asymptote decayed hyperbolically. 
The period of time that it takes the average to level off is a meas- 
ure of the longest dynamic cycle present. 

Data on a selected experiment performed this way at a tem- 
perature of 20.5 C are shown in Fig. 3. Pressure readings were 
taken on the tank every 15 min for 5'/. hr. The data in this case 
were obtained in a more normal environment. Another young 
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male (about 18 years old) lightly dressed sat in a rather large air- 
conditioned conference room, in which a modest sweeping circula- 
tion of air existed but with no direct breezes. He sat in a rather 
open chair and performed the sedentary activity of reading books 
with a minimum of squirming. He was introduced into the en- 
vironment as a step function from normal activity at normal tem- 
perature (25 C). The room, and tank, had been left cooling to 
equilibrium for the previous 16 hr. 

The air consumption averaged over increasing time is shown 
in Fig. 3. Since these data include the initial transient, what is 
also shown is the average consumption for each 15 min, the ap- 
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proximate (but filtered to the 15-min level) cycle (these data thus 
contaia all cycles longer than 15 min), and also the increasing 
time average integrated backwards from the end of time. It is 
clear from these data that the integrated damped oscillation takes 
a period of the order of 3 hr to substantially disappear. Thus 
again, in an independent fashion (with even considerably more 
precision), the reality of dynamic cycles is demonstrated, as well 
as the reality that thermodynamic equilibrium is only re-estab- 
lished in a test or averaging period of the order of 3 hr. 

It appeared at this point that, having demonstrated a control 
action in the human being, a period of time over which equi- 
librium thermal exchange could be rationally invoked, and a 
rational variation with ambient temperature changes [recall that 
the needs for a conductance model K(7’, — 7',) involve three ele- 
ments, “constancy” of load K; definiteness in the meaning, 
effect and determination of 7, and T,. The effect of these ex- 
periments had been to place 7’, in a rational context], that a suf- 
ficient description was available to demonstrate the effect of 
clothes; i.e., that there should exist a sensitivity to change 
in clothes correlary to a change in temperature. Thus the next 
series of experiments was at constant temperature [temperatures 
in the range 21 to 22 C were used. This temperature was chosen 
to be low enough to lie in the metabolic range, where it could be 
estimated both from the literature and from the experiment that 
the subject would shiver], with an incremental change in clothes. 
Specifically, a somewhat light jacket was added to the subject’s 
clothes. An approximate estimate suggested that the incremental 
change in clothes value would be of the order of “‘a modest frac- 
tion” of aclo unit. Experimentally, the subject reported reasona- 
ble comfort with the jacket, but chill over a long period of time 
without it. These experiments were independently performed; 
i.e., a step function from normal activity and temperature to 
quiet readings at 21 C. The results in integrated air consumption 
were disappointing. Inadequate change in air consumption or 
metabolism was found. These experiments were repeated until 
it was quite clear that reproducible accuracies of better than 2 
per cent were being achieved but that oniy small changes in 
metabolism were obtained with incremental clothes change. 
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Yet the subject was conscious of changes in “comfort’’ level. 
Thus it was clear that the subjects sensors (or controllers) were 
either more sensitive or responded in some other fashion than 
had been assumed. 

After some confusion and experimentation, a final hypothesis 
was arrived at, which appears to forge the last link in a correct 
phenomenological description. Again, this latter hypothesis, 
while related to concepts previously discussed in the literature, 
puts them into a proper light. 

It is often stated in the literature that the extremities are the 
major sources of heat exchange. This concept could not be put 
into useful and operational meaning, except to take it at face 
value that the exposed body area in extremities is large. How- 
ever, there existed a more fruitful possibility in the previous 
analysis. 

The negative results of the incremental clothes experiments 
could not be gainsaid. The net equilibrium averaged metabolism 
of the body remained substantially constant even though clothes 
were changed. Thus either a change in clothes conductance or a 
change in skin temperature had to take place. Yet earlier ex- 
periments had indicated that there is a tendency to regulate skin 
temperature at fixed values. Thus either this rough regulating 
hypothesis was incorrect or the clothes conductance changed. 
The basic clue to resolving the dilemma lay in the nonlinearity of 
the system. The nonlinearity created the possibility of more 
than one singularity (here say a point of stable equilibrium) 
around which the system could operate; ie., that the controls 
were not single valued. For this an additional controller degree 
of freedom had to be found. This lay in the extremities, or more 
particularly in the “valves’’ to the extremities. 

The common description of vasomotor control involves a control 
of radial conductance in the body through the fluid-exchange 
system (i.e., the conductance of the layer outside of the muscles 
can be changed by vasoconstriction or vasodilation. This is con- 
sidered to be the controller mode for the vasomotor zone). How- 
ever, there is also an “axial’’ control of conductance. While as a 
first approximation one may regard the human system as a 
sphere with three shell-like zones as representing only a modest 
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Fig. 3 Cumulatively integrated average air consumption of human subject 
the number of psi/min used from an approximately 2/3 cu ft volume) 


at 21 C (expressed as 
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deformation of its elongated form, it now becomes intrinsic that 
conductance control is not single moded, but is lumped into more 
than one element in length. This represents an independent de- 
gree of freedom. 

The question on how the control function involving axial con- 
ductance was called into operation was given the following in- 
terpretation: There is either no control (or actually a positive 
look on this control) or there is an independent feedback control 
system of the engine as a whole. In a quiescent situation, either 
consciously or unconsciously imposed on the system, the body is 
ruthless. If a heat loss persists in the system, the body shuts off 
the heat exchange to the extremities and lets them hang on es- 
sentially as passive capacitances in the ambient environment 
(i.e., they assume essentially ambient temperature). The com- 
plete experimental details of this process are not clear. Pre- 
sumably ever-increasing chunks of the body are shed off into the 
ambient environment, until perhaps only the vital organs are 
regulated. (Data on this point, however, would have to be 
gathered.) This represents one branch of operation, which per- 
haps may be best described as the “survival mode of operation 
of the system.’”’ 

In another branch of operation, the system has a more active 
mode of control. The heat engine is excited by central control 
system signals which derive from sensor signals in the extremities. 
Here the extremities are used intrinsically only as null indicators. 
If an extremity temperature begins to drop, an extremity tem- 
perature indicator sets up a feedback signal in response to which 
exercise is performed—perhaps by that extremity. The engine 
“shuffles,’’ mills about, slaps or rubs the extremities, or puts the 
hands in the pocket, and so on. These acts are done with no ex- 
ternal work (the point is not that they are necessarily done with 
no external work, but that they may be done with no external 
work. Masses of the body are raised and lowered with no net 
mechanical work done on the environment. The same over-all 
average heat-balance equation may apply but the values of 
various terms must change. Thus a multivalued system of equi- 
libria may exist). However, internal work is done by the muscu- 
lar heat engine which is degraded into heat. It is to be expected 
that this mode of equilibrium operation will occur with increased 
metabolism. It may be referred to as the active metabolic mode 
or comfort mode gf temperature control, for in this mode the 
body (namely using the extremities as null indicators) indicates 
its own comfort. 

While this interpretation of events uses similar concepts to 
those commonly alluded to in the physiological literature, it may 
be noted that the sequence and interpretation of events is dif- 
ferent. The extremities are not the main sources of heat regula- 
tion in the body; they are null temperature indicators. A dis- 
tinction must be made between radial and axial conductances. 
A meaning can be assigned simultaneously to the statements that 
average skin temperature changes with external temperature, and 
that the skin temperature tends to be regulated. If one imagines 
a localized skin temperature which is constant for all environ- 
mental temperatures above a definite “control” temperature, and 
which then drops to ambient temperature for lower temperature 
one has the beginning of an idealized model. More precisely in- 
stead of a sharp break from a level temperature, if there is a 
slightly drooping regulating curve with a break taking place at 
some point on the knee of the curve a more probable description 
obtains. Data that point to the validity of this concept may be 
found in Sheard’s article in [2]. Either due to varying break 
sensitivities of different parts of the skin, or different immediate 
environment temperatures, in the survival mode of operation, in- 
creasing areas of the skin drop more nearly to ambient tempera- 
ture. This is interpreted as a drop in mean temperature, whereas 
the more precise interpretation should or could be that regulated 
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portions of the skin are still regulated, but an increased surface 
area of the body is not regulated. 

It would thus appear that the horns of the dilemma in the de- 
scription of the heat balance given in the literature have been 
bent into shape. The conventional description has utilized the 
unit area of body surface as a normalized variable. However, the 
“true” heat loss of an equivalent conductance term must be 
derived from 


h(T, — T,)dA 


Now either this term is computed as an average over the entire 
nonre-entrant body area A, as 


A(T, — T,)A, 


in which 7’, and h are both undescribed functions of the operating 
conditions (i.e., they must be independently measured); and h 
may be considered to depend on the mode of operation of the 
system (for example, in the extreme, a sock is of little insulating 
value to an element which has become passive, except as con- 
tributing to a time delay); or it may be computed by 


— T,)A 


(assuming a sharp break to ambient temperature of skin tem- 
perature) in which A, is the effective conductance of clothes per 
unit area of active skin (which may vary with the portion of skin 
that is active); 7',, is the average regulated skin temperature (pre- 
sumably in the “vicinity’’ of 35 C); and A is a variable which 
is determined by the physiological mode of operation. 

The net practical results of this discussion is that the charac- 
terization of clothes as a conductance model depends on the mode 
of operation of the system; that it is stretched thin for applica- 
tion to the survival mode of operation in the cold (not that it 
may not be used, but that it must be used with extreme caution); 
and that it ie most applicable to a comfort mode of operation in 
which the user, say, exercises to ‘maintain comfort,’’ which finally 
can be given the unique operational and nonphysiological mean- 
ing that conditions of exercise are such that the extremities as null 
sensors are held at a “constant” level. As a final practical conse- 
quence, it suggests a new or truer point of view in the design of 
clothes. 

Thus clothes can be considered from two points of view (par- 
ticularly in the cold); as “normal’’ comfort level elements or as 
“extreme’’ survival level elements. The former point of view be- 
comes the manufacturer, designer, and evaluator of “‘normal’’ 
clothes for active environments. The latter becomes those 
interested in survival activities (such as the quiescence of sleep- 
ing, or other forms of forced inactivity in extreme temperature). 
Thus it becomes clear that the prescription too often imposed in 
both the experiments here initially reported, and in the literature 
in which the subject actively co-operates to remain passive are 
artificial representations of only one extreme of operation, and as 
such are not necessarily the “normal’’ mode of physiological 
thermoregulation. 

This, therefore, closes as much commentary on physiological 
matters as a physicist may be entitled to. The remainder of the 
task of physiological interpretation and validation of control 
and sensing mechanism must be applied by the physiologist. 
The dynamic time scales and the relative dynamic responses to 
step functions must afford him his basic clues as to the possible 
analytic descriptions. It would appear that these intermediate 
time scales pose very specific problems of being too slow for hy- 
drodynamic fluid exchange, nervous, and equilibrium chemical 
reaction time, but too fast for thermal exchanges. This would 
appear to leave diffusion of mass products as the most promising 
element to provide appropriate time scales. An added word of 
caution lies in any attempt at a description of nonlinear proc- 
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esses. The analytic solution must be relatively precise in order 
to give rise to the proper stability of the indicated limit cycles 
[3]. 

A preliminary experiment was performed to test the validity of 
this concept of operational mode of the system. First glass ther- 
mometers were crudely inserted in the instep of a shoed and 
socked foot and in a lightly gloved hand. The subject was told 
to step up and down a small step at a faster or slower rate, or 
move the hands up and down at a faster or slower rate in re- 
sponse to commands. The commands were based on rapid in- 
termittent reading of the thermometers, choosing a temperature 
as reference at which the extremities were found under a normal 
sedentary activity level in the vasomotor region, the subject was 
given commands to exercise the extremities to hold this tem- 
perature constant (a switching band of nominally 0.1 deg C 
width was originally sought for the error level, but it was difficult 
to maintain level better than 0.5 deg C in this crude experi- 
ment). At the same time air consumption level was measured. 

For the first time, the system (i.e., the human) had the proper 
operating ‘‘feel’’ of a control system. It was “lively,”’ the feed- 
back of information (through the observer loop) was copious; 


adjustments of speed or power level were sensitive, the entire 


concept of a system that was “dragging its feet’’ in control dis- 
appeared, and perhaps most significantly of all, the feeling of de- 
pendence on psychological factors (the feeling of dependence on 
the width of a friction band, and the precise means by which sig- 
nal was applied) disappeared. A better description lies in the 
usual feeling when an instrument or system is tested at input 
levels too close to its ultimate sensitivity; i.e., microvolt excita- 
tion of an instrument with a microvolt noise level. As soon as the 
inputs get large enough, the “friction’’ level or “‘noise’’ level of 
the instrument begins to assume its proper perspective as a small 


_ fractional or percentage source of error. Whereas in previous 


experiments the state of mind of the subject was of considerable 
concern (was he relaxed, should he be instructed, what should he 
be told to do in all kinds of anticipated emergencies), and its 
effect incomprehensible (the similar questions in low-level excita- 
tion experiments are the thousands of meaningless questions re- 
garding the precise possible sources of noise), now it didn’t matter. 
The subject was just too busy in high-level activity to be con- 
cerned with minor disturbance sources. In other words the 
directed signaling bedlam of a feedback loop in attempting to 
maintain a set point was found. At last, for example, there was 
the “feel’”’ of a more sensitive operational measure of a “comfort’’ 
level than the psychological response. From beginning drops of 
extremity temperatures of tenths of a degree, the observer could 
anticipate ultimate “discomfort” or “feeling of cold’’ much before 
the subject. 

Thus while for the first time the stage setting for proper testing 
of clothes, and evaluation of the human as a complete system was 
achieved, many “minor” questions now raised themselves. Ex- 
perimentally, for example, internal exercising by using the entire 
body seemed too “vigorous’’ (for example, the chest began to 
sweat when raising and lowering the body). Questions thus 
formed themselves as to whether the evaporative mechanism was 
a follower (say of a relaxation type), or an active controller; 
whether the “null” balance indicators of the extremities were 
lumped or distributed (that they are not localized appears ob- 
vious. However, this does not contribute information as to 
whether the null-balance system is distributed as a single equiva- 
lent lumped element, or multimoded); what the nature of the sub- 
sidiary control loops might be (that is whether null detection in one 
site, leads to engine action in a second site, involving convective 
mass transfer, or diffusive mass transfer of heat-exchange fluid, 
or whether the system is partially open in that part of the heat 
exchange is in a closed-conduit system going to one site, and part 
in an epen-conduit system going to another site). Thus as a 
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fielder’s choice the first simple assumption to test was that the 
auxiliary loop was a closed-conduital system of a single lumped 
element in which the auxiliary circuit elements were all localized 
in the controlled site; i.e., a lumped sensor at the extremity could 
signal to turn on a lumped engine element at the extremity to 
control a lumped localized valve that controlled the “flow’’ of 
heat-exchange fluid. It is obvious that this literal model is non- 
sense. However, it permits rough characterization of the process 
and some understanding of what may be expected in dynamic re- 
sponse. More important, it extends the count of the number of 
degrees of freedom to the minimal number that is required for the 
beginnings of a primitive model of the entire system. Thus now 
the body must have at least three zones as counted in the radial 
direction, and at least one equivalent independent three-zone 
side branch. 

Thus a second crude experiment was performed in which the 
internal work was confined, as far as possible, only to the extremi- 
ties. A bicycle was adapted to provide a lift treadle for the hands 
with only slight forearm motion, by “immobilizing’’ the muscular 
structure above the elbows by creating a fixed resting point for the 
elbows; and to provide a treadle for the feet, in which the knee 
was essentially fixed in space, and, in the main, foot lift only used. 
Loading of a nondissipative sort was provided through “friction- 
less’ pulleys attached to dead weights, in which change in 
weight was always made at a fixed height. Thus the excitation 
of these weights (i.e., raising and lowering them) satisfied the 
conditions of reversible quasi-static equilibrium in which no ex- 
ternal work is done. Null measuring stations to provide error 
signals for the feedback loop were taken at the toe and finger. 
An additional monitoring station was taken on the chest. How- 
ever, the controls were only for the hands and feet. A sensitive 
potentiometer and thermocouple were used to measure hand and 
foot temperatures. A schedule of measurement and weight ad- 
justment of the order of 5 sec was chosen and attempts were 
made to maintain this schedule with only modest success. (As 
a rough summary it was found that the “‘valves’’ appear to have 
considerable dead time so that quite rapid control is needed to 
hang onto the set point). However, the over-all experiment 
appeared to work beautifully. As in the previous crude experi- 
ment, a step-function change in clothes level was chosen for a 
signal change while at constant temperature. The previous 
experiment had already indicated the approximate transient 
time to allow before counting the dynamic equilibrium. In both 
experiments, the immediate and strikingly notable difference 
could be discerned that the load against which internal work must 
be done to maintain null balance in the extremities shifts largely, 
as was expected, with and without a jacket (i.e., the thermal load 
could be “‘seen’”’ in the weight in the loading pans), and finally, 
in the latter experiment at last some notable changes in meta- 
bolic level (or air-consumption level) occurred. Some preliminary 
data of this type are reported in Fig. 4. The data were taken by 
the same system used for the data in Fig. 2. Operating condi- 
tions were about 15 deg C. A light'y clothed young man was 
subjected to a step function in temperature of 15 C. After suf- 
ficient time had elapsed that it was reasonably certain that initial 
transients had decayed (approximately 2/; hr), he was then sub- 
jected to a step function of putting on a light jacket. The ac- 
tivity level to retain “‘comfort’’ dropped very quickly. However, 
it may be noted in Fig. 4 that the decay transient in air con- 
sumption (or metabolism) remained about the same, and about 
the same as in the passive mode of operation shown in Fig. 2. 
(A damped “oscillation’”’ with a period in the 1000-sec range.) It 
is thus quite reasonable that it is this periodic decay and the 
periodic sustained oscillatory cycle of this frequency that is most 
likely the main controller of thermoregulation in the body. 

It may also be noted that air consumption (or metabolic 
change) is considerable. A second exploratory change of taking 
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Fig.4 Air consumption of a human subject at 15 C while engaged in just sufficient recoverable work 
with the hands and feet to keep those extremities at a “comfortable” temperature of 32 C 


the pants down to half mast didn’t appear to make much change. 
However, the subsequent removal of the jacket showed a transient 
that seemed to recover the earlier air consumption. Thus even 
though the data at this time are limited, evidence for an operating 
mode of the human that tends to give equilibrium cycles and thus 
equilibrium states with regards to a set of clothes appears to have 
been achieved. 

It is the intent at this time to construct an adequate automatic 
feedback apparatus and unquestionable ‘“‘metabolic’’ apparatus 
for further exploration. 

Thus in conclusion, it is believed that a basic model has been 
developed for measuring clothes. It may provisionally be con- 
sidered a conductance model. It is based on the equation 

T,-—T, 

Here M is the average metabolism obtained from an integrating 
period of the order of 3 hr; L is the loss of power in stack products 
which do not pass through the clothes (i.e., breathing, face, other 
exposed parts); 7’, is to be operationally defined through the 
mode of operation of the system. It would appear to be a most 
meaningful quantity for active “‘comfort’’ operation. However, 
in any case, it would have to be averaged over a period of time for 
equilibrium steady-state cycles to occur. It is the latter difficulty 
which probably would most quickly lend conviction to the need 
of a impedance law; i.e., resistance plus reactance, or conductance 
plus admittance. 7, is the equivalent temperature of the ex- 
ternal environment; i.e., it is to be derived in a manner similar to, 
but perhaps an extension of, Gagge’s concept of a mean operative 
temperature. K is a conductance which now is no longer neces- 
sarily constant or even single valued but at least describable in 
an averaging sense. It is clear now that a priori models for K 
must be constructed in a dynamic fashion, that is, must be con- 
structed from dynamic treatment of a heat-transfer problem in 
which the dynamic laws of radiation, conduction, convection, 
storage capacitance, diffusion (say, at least of water vapor), and 
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evaporation are treated. Thus in principle, the conductance of 
clothes must be computed from a complete hydrodynamic field 
equation set. 

Underlying physiological assumptions are contained not in this 
equation but in the dynamics of the fundamental equation from 
which this “result” may beconsidered derived. The instantaneous 
variations in an instantaneous dynamic equation reflects the 
specific physiological mechanisms that are operative. However, 
the average in time, over equilibrium time, results in this equa- 
tion. 

The broad loose physiological mechanisms that have been 
alluded to are that, in the cold, either a zone of ‘‘metabolic control” 
is achievable at constant comfort levels, or a zone of “‘axial con- 
ductance’’ control is achievable at constant average metabolic 
level; that these two cases are extremes; that all zones involve an 
active metabolic controller in the form of a heat engine which is 
“constantly oscillatory”; that the vasomotor region (if it exists 
as a true region) is most possibly properly described as a “radial 
conductance’’ control zone, in which the average metabolism is 
stationary over a finite-temperature range; but that it is not yet 
completely understood how the higher temperature controller 
works. This area of uncertainty could be resolved possibly 
either by dynamic over-all weight measurements (the required 
sensitivity and frequency-response pose some difficulties), or 
microscopically on the basis of “valve mechanisms,’’ by which 
water is made available at the skin surface dynamically. The im- 
portant clues left for the physiologist is the rough order of mag- 
nitude of time scale at which various dynamic phenomena occur. 
The basic implication in this report is that an end must be put to 
static physiological models in studying human phenomena. The 
mechanisms must be described dynamically! 


Summary 


A rational basis for a conductance model of clothing has been 
established. It derives from a phenomenological model of the 
physiological man as a thermoregulating system. The model 
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of the system involves a liquid heat exchanger that permits 
radial and axial exchange of heat by forced convection or diffusion 
both in an inward and Gutward direction. This distributed heat 
exchanger, which may be regarded as a generalized hydrodynamic 
field, however, is partitioned by a sheath-like heat engine (the 
muscles) which thus establishes three thermal zones; an internal 
well-regulated temperature zone, a transition zone in which heat is 
actively produced as a regulating function, and an exterior zone 
in which temperature is approximately regulated. The general- 
ized hydrodynamic field, partially by virtue of the fact that an 
elongated body is being dealt with rather than a compact spheri- 
cal body, is further partitioned by a distributed valve system to 
control the flux of heat-exchange fluid to the extremities. The 
heat-exchange system is not a completely closed conduital sys- 
tem, so that some heat-exchange fluid is available to the skin of 
the system through control valves for diffusive-evaporative heat 
flux. Thus the number of degrees of freedom of the system and 
the potential number of controllers include heat of combustion, 
radial conductance, axial conductance, mass flux of heat-exchange 
fluid to the skin of the system, storage (as measured by tem- 
perature) of thermal flux into the inner, outer, and peripheral 
zones. It is not completely clear as to what are the number of 
sensing elements that are used to generate error signals for feed- 
back control. However, some novel points of view, with regard 
to sensing elements that have been suggested in the past, are 
that skin temperature is a slowly varying function of ambient 
temperature (at least in restricted extremity portions of the skin); 
with a relatively sharp break to ambient temperature at some 
critical ambient temperature. The break or possibly the slope of 
this curve at the break is used as a thermal sensing element for 
the control of axial conductance. A second novel point of view 
is that extremity temperature is used as a null indicator of ‘“‘ecom- 
fort.’ While one feedback control loop involves shutting off of 
axial-fluid exchange, a second loop involves the system develop- 
ing an activity level in which internal work is immediately de- 
graded into heat in the muscle heat engine to maintain extremity 
temperature. The postulated existence of more than one operat- 
ing mode of the system implies nonlinearity in controller and 
error-function detectors. This is borne out by the active con- 
tinued dynamic response of the system at all times. The charac- 
teristic nature of this dynamic cyclic behavior has been demon- 
strated experimentally and shown to persist in all so-called zones 
of thermoregulation. Thus some significant modifications are 
necessary for the usual description of these zones. The harmonic 
content of the dynamic characteristics of the system is rich, but 
denumerable. In fact, approximately four major cycles at fre- 
quencies lower than the breathing frequency have been demon- 
strated experimentally, and thus shown to contribute to the 
thermoregulation of the system. It would appear that thermo- 
dynamic equilibrium (i.e., a true steady-state cycle) is estab- 
lished in the order of 3'/2 hr. However, there are significant 
thermoregulating cycles down to the 100-sec level. 

The integration cf dynamic performance over an equilibrium 
time domain (i.e., 3'/2 hr) permits the writing of an ‘equivalent 
static equilibrium” description of a heat balance in the form 
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Net rate of heat production 
Temperature difference 


= conductance 


or, mathematically 


where 


M = rate of average heat production over the averaging 
period (i.e., the metabolism, assessable by oxygen 
and carbon dioxide consumption and production) 

L = rate of average flux of energy that does not pass through 
the clothes (i.e., heat losses from the lungs) 

T, = an effective environmental operative temperature 
(this, in view of the dynamics suggested, requires an 
extension of the Gagge definition) 

T, = mean skin temperature averaged over time for a par- 
ticular mode of operation of the system (most com- 
prehensibly, this mode should be a mode in which the 


extremity temperatures. Whether the pure con- 
ductance-model description can be retained for other 
modes of operation of the system is moot and should 
be the subject of further experimental investigation) 

K = effective conductance of a set of clothes measured 
under the operational conditions specified. It may 
neither be constant, nor single valued 


The method by which the conductance of a set of clothes might 
be computed theoretically begins to emerge conceptually. One 
must set up and solve a complete set of dynamic hydrody- 
namic field equations (suitably extended) subject to the dynamic 
boundary conditions determined by the human system, and then 
integrated in time to produce the time-averaged conductance. 
Since an accurate description is not yet possible for the dynamic 
boundary conditions that obtain of the surface of the system, such 
computations are still moot. Either experimenta! data on the 
conductance of simple-clothes systems are required, or a system- 
atic analytic theory and experimental data on the dynamic 
thermoregulation of the human system are required to furnish 
data to build up and test clothing conductances. 

Thus what is primarily needed is a dynamic analytic theory of 
the thermoregulation of the human system from the physiologist, 
experimental data on clothes conductances from the clothing 
technologist, and a theory of clothes conductance from the physi- 
cist. 
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Some General Considerations 
in Process Optimization 


Problems of process optimization and in particular automatic optimization are cur- 


rently being approached from many different directions. Investigations in process con- 
trol and instrumentation, studies in statistical methods for the design of experiments 
for obtaining optima, studies on noise spectra and on numerical maximization subject 
to restraints as represented by linear and nonlinear programming, all throw light on the 


What Should Be Optimized 


I. ANY consideration of optimization one of the first 
things which must be decided is what should be optimized. Very 
careful consideration of this question is needed in each instance, 
because if we are misled into maximizing the wrong criterion our 
efforts may lead to waste rather than to savings. In chemical 
problems there are usually a number of responses such as rate of 
production, yield, and purity of the various products, which are 
influenced by the level of controllable variables such as tempera- 
ture, pressure, concentration, and contact time. It can be argued 
that it should be possible to obtain some single function of these 
responses, for example some criterion of cash return or profitabil- 
ity, which alone measures the desirability of a given setting of the 
variables. In practice it is not always easy to obtain such a func- 
tion. In the case of the manufacture of a drug, for example, what 
monetary value could be placed on impurities which might make 
this drug dangerous? 

In practice the objective is often best expressed in terms of 
optimizing some principal response, usually related to cost, sub- 
ject to certain restrictions. These restrictions may limit the 
operating variables directly; for example, certain pressures and 
temperatures, or combinations of pressure and temperature, may 
be unobtainable or dangerous. Alternatively, they may limit an 
auxiliary response, for example purity, which is determined 
simultaneously with the principal response and so limits the 
operating variables at second hand via the process function. 

The situation is illustrated in Fig. 1 where the principal 
response is cost of product per unit weight and an auxiliary re- 
sponse is impurity of the product in some appropriate unit. 
Contours of cost 7, and of impurity 72 are shown in the space of 
the variables &, = pressure and & = temperature. If the object 
was to minimize the cost 7, subject to the restriction that the 
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directions in which we must advance. 


pressure should not be greater than &, = & and impurity level 
not greater than 9: = 0.2, the problem would be to find that point 
of minimum cost within the restricted area denoted by the shading 
in Fig. 1. This is in fact the point P. 

This can be expressed by saying that we wish to minimize a 
process function? in the principal response 


m = fil€) (1) 
subject to restraints on the auxiliary responses of the form 
na < = < (2) 
and to direct restraints on the variables 
Gan < AE) < Gp (3) 
where ¢,(£) is a function of the variables §. The relationship of 


this problem to that of linear programming is briefly discussed 
in 


4 Numbers in brackets designate References at end of paper. 


€ = TEMPERATURE 


= Maximum OPERATING 
to TRES SURE 


Fig. 1 Contours of process functions of cost and impurity, showing 
restraints 
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Methods for Finding Out What Optima Are Like 


It is clear that in principle, if the functional relationships 7; = 
f(&) connecting the responses and process variables are known, 
then consideration of the problem of obtaining optimum condi- 
tions can begin. The solution of problems of optimization is 
greatly influenced by the nature of these functions. In chemical 
examples the principal response is often either yield itself or some 
quantity such as cost or profitability closely related to yield. 

Over the past nine years, many studies [2-11], have been con- 
ducted with the specific object of determining the nature of re- 
sponse functions of this kind in the neighborhood of maxima or 
minima. These studies have been of two kinds: (a) Those having 
a purely empirical basis, (b) those having a theoretical or pseudo- 
theoretical basis. 


Empirical Method of Exploring Optimal Regions 


In [2], [3], and [9] methods are described for approximately 
locating optimal regions. These references also describe proce- 
dures for exploring the regions after they are approximately 
located. This exploration is done initially by sets of experiments 
arranged in special patterns. The experiments illustrated in Fig. 
2 are from an investigation described in [3], [5], and [6] in which 
the yield of a particular product was studied at various conditions 
of the variables: 


& = T = temperature in deg C 
& = c = relative concentration of reactant, per cent 
& = t = time of reaction in hours 


These experiments were performed after an optimal region had 
been approximately located by the method of steepest ascent. 
Some general idea of the nature of this region was then obtained 
by fitting a second-degree equation in the standardized variables 
to the observed values 

x, = (T — 167)/5 

n= (c 27.5)/2.5 (4) 
as = (t — 6.5)/1.5 


tion of the method of least squares was 


y = 58.78 + 1.902, + 0.9722 + 1.0623; — 1.882? 
+ 1.6922? 0.9523? 2.172123 1.247073 


Tiw t hours 
optimal region 
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The second-degree equation of best fit obtained by the applica- 


(5) 


Fig. 2 An experimental arrangement in three variables to explore an 


Figs. 3 and-4 show contours of some of the systems in two and 
in three dimensions, which the second-degree equation can repre- 
sent. It will be seen that this form of equation may be used 
roughly to delineate various types of optimal regions. 

Numerous examples studied through the years have shown that 
in practice ridge systems, that is, systems with contours like those 
shown in Figs. 3(b) and 3(c) and in 4(b), 4(c), 4(d), and 4(e), are 
most common. When this work was begun we were somewhat 
surprised at this for we had been thinking of multidimensional 
maxima in which the response fell off fairly evenly in all direc- 
tions. 


Fig.3 Contours on surfaces g 
variables 


Fig.4 Contours of surface generated by second degree equation in three 
variables 
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Canonical Analysis 


It is well known that the nature of a second-degree equation like 
Equation (5) can be readily elucidated by referring the variables 
to the center of the system® as origin, and rotating the axes so 
that the product terms vanish. This process is referred to as 
canonical analysis. The equation is then in the form 


= yo t + + (6) 
* In some cases the center of the system will be remote from the 


immediate region studied; ways of dealing with these cases are dis- 
cussed in [3], [5], and [6]. 


system showing plane of maximums 


Fig. 6 Appearance of contour system showing line of maximums 
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where yo is the value of y at the stationary point and X;, X2, and 
X; are new orthogonal axes, the coefficients By, Bs, and By 
measuring quadratic curvature along these axes. 

The existence of ridges is indicated when one or more of the 
coefficients B,, is small, or not significantly different from zero. 
In the present example it has been shown [6] and [17] that the 
data are adequately accounted for in the immediate region studied 
(although, of course, not necessarily elsewhere) by a model in which 
both Bz and By are zero—in fact the model 


y = 60.0 — 3.8X,? (7) 
where 
X, = 0.15T + 0.19¢ + 0.301 — 31.97 (8) 


Contours of this system are plotted in Fig. 5. 

It will be seen that the situation is that, locally at least, there 
is not a point maximum but a surface of maxima. To put it 
anotner way we could say there are two degrees of freedom in 
the maximum. The maximal surface is approximated by the plane 
defined by Equation (8). The yield takes its maximal value of 
60.0 for any set of values of 7’, c, and ¢t in the range studied which 
makes X, = 0. The coefficients in Equation (8) thus indicate the 
adjustments (necessary to maintain the yield at its maximal level) 
to be made in any one of the variables in order to compensate for 
changes in others. 

Fig. 6, taken from a study by Franklin, Pinchbeck, and Popper 
[7] and Pinchbeck [8] on the optimization of yield of phthallic 
anhydride from oxidation of naphthalene, shows another com- 
mon situation. Here the surface is roughly approximated by a 
second-degree equation in which there is a line of maxima, that is, 
one degree of freedom in the maximum. 

Both of these cases mentioned so far involve stationary ridges, 
that is to say, systems for which the value of the maximum re- 
mains constant along the crest of the ridge. Other commonly 
occurring situations involve rising ridges which in two and three 
dimensions are approximated by the systems of Figs. 3(c) and 
4(d) and 4(e). These too can be elucidated by the approach re- 
ferred to in [3], [5], [6]. It is of course usually the case for a 
rising ridge that some practical limitation, such as a maximum 
working pressure or temperature, ultimately restricts the system 
so that the practical maximum consists of an optimal point just 
below the imposed barrier. 


Theoretical Approach 


Analysis of commonly occurring chemical systems shows that 
the ridges found empirically are to be expected on theoretical 
grounds. For example, the system studied in the section on the 
empirical method of exploring optimal region, and approximated 
by Fig. 5, can be most simply explained by a sequential reaction 


where the intermediate product C is the quantity whose yield is 
to be studied. In [6], [10], and [17] a theoretical function relat- 
ing yield of C to temperature, concentration, and time of reaction 
is developed based on simple kinetics. When, as an alternative 
to the purely empirical form, this function is fitted one obtains 
the surface shown in Fig. 7. 

The remarkable similarity between the characteristics of the 
empirical and theoretical surfaces will be noted. In particular 
it will be seen that the chief characteristic of the theoretical func- 
tion is that there is a whole surface on which the yield is maximal. 

A similar study has been made by Pinchbeck [8] for the example 
in the section on canonical analysis. For his example, a kinetic 
hypothesis, that the decomposition of naphthalene is half-order, 
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Fig. 7 Contours of theoretical yield surface 


yields a surface whose main characteristic is a line ridge similar 
in appearance to the empirical system plotted in Fig. 6. 


Importance of the Nature of the Response Function 


A knowledge of the nature of the response function in the op- 
timal region is of importance because in practice optima must be 
found and maintained by some process analogous to one or other 
method for the iterative solution of equations [11]. For ex- 
ample, the quadratic equation (5) has its stationary value at a 
point which is the solution of the linear equations obtained by 
differentiating this expression 


1.90 — 3.762, — 2.71z — 2.17%, = 0 

On; 

oy 

— = 0.97 — 2.7lz, — 3.3822 — 1.242; = 0 (9) 
Ore 

OY. 1.06 — 2.172, — 1.242 — 1.902, = 0 

Ors 


These equations are very poorly conditioned [12], for to a first 
approximation they have not a point solution but a whole plane 
of solutions. The matrix of the coefficients of the z’s approaches 
a matrix of rank 1, that is two less than the full rank 3. This loss 
of rank corresponds to the existence of two degrees of freedom in 
the maximum. The corresponding matrix in the Pinchbeck ex- 
ample approaches rank 2, one less than the full rank 3 correspond- 
ing to one degree of freedom in the maximum. 

As is well known, when singular situations are approached, 
iterative methods of finding solutions can become extremely 
unstable. Stability may be induced, however, by a suitable trans- 
formation of the variables. For example, it is clear that if it were 
exactly true that the maximum contained two degrees of free- 
dom, stability would be induced by controlling and optimizing 
only one variable rather than by attempting to deal with all 
three. In practice the true situation will not be represented 
exactly, but only approximated by these systems in which one or 
more functions of the operating variables are totally redundant. 
Suitable control can be introduced by using procedures which 
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Fig. 8 Evolutionary Operation: 
16 cycles 


Appearance of information board after 


make small changes in the important canonical variables such as 
X;, in Equation (6) and very large changes in the nearly redundant 
canonical variables or in suitable functions of them. 


Evolutionary Operation 


During the last few years a system of continuous optimization 
called Evolutionary Operation (EVOP) [13] has been in use in a 
number of chemical plants both in England and, more recently 
{14], in America. In this method the plant is not run at a fixed 
set of operating conditions; instead, in each successive running 
period systematic small modifications are made following a care- 
fully planned and continuously repeated cycle. The changes in 
the process conditions are too small to produce effects on the 
product which can be detected in individual runs, but as the re- 
sults are averaged, evidence on the effects of the changes is 
gradually built up. It is then possible to see in what way the 
process recipe and mode of operation should be modified in order 
to obtain greater efficiency. This improvement is incorporated 
by making a modification in the indicated direction and a new 
cycle is then commenced about these newly found conditions. 

Feedback is provided by continuously recording results on an 
Information Board kept in the plant superintendent’s office, which 
also shows the requirements it is designed to satisfy. The plant 
superintendent decides at any particular stage whether: (a) To 
wait for further information, or (b) to modify operation. 

The result at any particular stage of one such scheme discussed 
in detail in Ref. [13] (see also [16]) is shown in Fig. 8, which 
shows some of the material recorded on the Information Board. 
The situation is shown after 16 cycles of operation for a scheme 
in which the slightly modified conditions denoted in the diagram 
by the points labelled 2, 3, 4, and 5 were run in sequence with 
the standard process (1). The averaged values after 16 cycles for 
cost, impurity, and fluidity are shown together with the 95 per 
cent error limits. At this point, using the indications from the 
data, a change was made to a lower concentration level. At this 
concentration, lower costs were obtained without harmful effects 
on impurity and fluidity. The evolutionary process was then re- 
started at the new conditions. 

The schemes are run by plant personnel themselves under the 
supervision of the plant superintendent and experience shows that 
this can be done at little extra expense and provides added in- 
terest and incentive. Production is thus used not only to gen- 
erate product but also to generate information on how to improve 
the product. Using this information, the process recipe and hence 
the routine of plant operation is slowly but inevitably nudged into 
that form ideally suited to the particular plant which is being 
operated. 
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Fig. 9 Diagrammatic representation of an Evolutionary Operation 
scheme showing feedback via closed loop 


Operation can be modified in a variety of ways depending on 
circumstances. Some of the alternatives open are: (a) To adopt 
one of the variants as the new ‘‘standard process’’ and to recom- 
mence the cycle about this point; (+) to explore an indicated 
favorable direction of advance and recommence the cycle around 
the best conditions found; (c) to change the pattern of variants 
to one in which the levels are more widely spaced or to modify the 
pattern of variants in any other way which seems relevant; (d) 
to substitute new variables for one or more of the old variables. 

An Evolutionary Operation committee meets periodically to 
discuss the results and proposes new variables to be included in 
later phases of operation. Fig. 9 shows the organization of the 
scheme thus developed. 

One of the strengths of Evolutionary Operation is its great 
flexibility. One month the Evolutionary Operation scheme may 
be used on one set of variables and another month perhaps on 
some completely different set, and many of the factors studied 
will be of the static kind for which choices can be made once and 
for all. In these changing circumstances it is most necessary to 
keep the plant superintendent himself in the control loop. ‘It 
would certainly not be desirable to attempt to introduce an auto- 
matic element into such a scheme. On the other hand, wherever 
there exists some dynamic factor such as aging of catalyst or 
changing quality of feedstock whose effect on the location of the 
optimum cannot be confidently predicted but must be con- 
tinually adjusted for, the introduction of an automatic element is 
highly desirable. 

Many of the principles which must be considered in the design 
of the manual schemes of Evolutionary Operation are equally im- 
portant in consideration of schemes of automatic procedures of 
optimization. We shall discuss these now considering first the 
application of EVOP to a single variable in a batch process. 


Simple Design Considerations for Evolutionary Operation on 
a Batch Process 


Consider a batch process in which a single variable—tempera- 
ture—is, as a standard routine, varied by a small amount about 
the normal operating level. Suppose the normal operating tem- 
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Fig. 10 Some diagrams of importance for Evolutionary Operation 
scheme on a batch process 


perature is 160 C and in the Evolutionary Operation scheme n 
successive batches are made at 162 C followed by n successive 
batches made at 158 C. Fig. 10(a) illustrates such a scheme with 
n = 3. Fig. 10(b) shows the steady-state process function for 
the yield of the system, that is the value 7 of the “‘true’’ yield at 
each value of the temperature —. In the example shown there is 
a difference, (162 deg) — (158 deg), in the true yield at the 
temperatures 162 and 158 C. In practice we cannot know the 
true value n(&) of the yield at any level of & but only the ob- 
served value, 


= ff) + 


where € measures the error or disturbance. In many chemical 
processes this error may be +5 per cent or even greater. The 
effect of this error or disturbance may be reduced by continuous 
repetition and averaging. So that if #(£) represents the observed 
average of a series of batches run at temperature £, we can 
actually calculate 


9( 162 deg) — 9158 deg ) = (162 deg) — (158 deg) + Az 


and Aé, the difference in averaged errors will after sufficient repe- 
tition become small compared to the difference of interest, (162 
deg) — (158 deg). 

Now it is of great importance that the scheme be run so that, in 
the face of the inevitable random variation, the change in yield 
produced by temperature change can be detected as quickly as 
possible. That is, for a given amount of repetition Az should be 
as small as possible. A question we need to answer is “‘should 
we run a number of batches at the level 162 C before changing the 
level to 158 C or should we make more frequent changes, pos- 
sibly after every batch?”’ 

In order to study this question, let us suppose that N = 2 nk 
is the total number of batches to be run. For a fixed N, we can 
now consider the effect of changing n, the number of batches we 
run before we make a change of conditions, where k is the number 
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of repetitions of the high-level and low-level runs. We can do this 
by choosing n so as to minimize the variance (mean squared 
error) V(Aé) of Az. It is easy to show that the optimal choice of 
n depends entirely on the nature of the disturbance ¢. When 
(as is rarely the case) there are no correlations between errors in 
successive observations, the value of n is immaterial, the ac- 
curacy obtained from a total of N batches being the same for all 
possible arrangements. | When correlations between successive 
observations occur, this is no longer true. 

The nature of the correlation picture is best shown by a correlo- 
gram. This is simply a graph of the correlation coefficient be- 
tween observations one batch apart (lag 1 serial correlation), two 
batches apart (lag 2 serial correlation) and so on. Fig. 10(di) 
shows how the correlogram would look in the situation in which 
slow trends with time illustrated in Fig. 10(ci) are occurring in 
the batch yield. In this case maximum accuracy is obtained for 
an arrangement in which changes are made as often as possible. 
That is after every batch so that n = 1. 

Fig. 10(cit) is typical of the so-called ‘‘carry-over’’ situation. 
Here, due to material being held over in pipes, pumps, and so 
forth, variation is introduced of the type where a low value for one 
batch is followed by a high value in the next. Lag 1 correlation 
between successive observations is then negative. In this case 
greatest accuracy is obtained by making n large. In practice, 
this situation is often intermixed with a slow trend giving rise to 
a correlogram like 10(dii), in which case a compromise value of n 
may be found. 


Automatic Optimization for Continuous Processes 


So far we have considered Evolutionary Operation applied to a 
batch process. When the procedure is applied to a continuous 
chemical process one entirely new feature arises, namely, the time 
response of the plant itself to induced disturbances. Before con- 
sidering this new feature we note how generalizations of the ideas 
already discussed apply in the continuous case. 

Referring to Fig. 11, the introduction of deliberate variation in 
the operating variable £ corresponds to the injection of an input 
signal. In principle we can use any form of signal we wish and in 
particular this might be a sine wave as illustrated in Fig. 11(a). 
In Fig. 11(b) once more we have the (steady-state) process func- 
tion which shows the level to which the response will tend for each 
value of the temperature &. Figs. 11(ci) and 11(cii) show types 
of disturbance which might be observed in the response y at 


jized conditions of the process variables. The particular forms of 


disturbance chosen for illustration are analogous to the ones dis- 
cussed for the discontinuous case. Their correlograms are shown 
in Figs. 11(di) and 11(diz). 

The implications of these different types of “noise’’ are most 
readily appreciated by considering not the correlograms but their 
cosine transforms—the power spectra which measure the variance 
or ‘‘power’’ of the disturbance a associated with each frequency 
in Figs. 11(ei) and 11(ezi). The fluctuations in y arising from the 
input signal € (temperature) will be superimposed on these ran- 
dom fluctuations. If it is assumed for the moment that the re- 
sponse of the plant was instantaneous then the introduction of a 
sinusoidal disturbance in € would result in a periodic variation in 
y and for the situation shown in Fig. 11 the chief component of 
this induced variation in y would be a sine wave of the same fre- 
quency and phase as that of the forcing function corresponding to 
the vertical lines in Figs. 11(ei) and 11(eii). This would be ac- 
companied by harmonics of higher frequency corresponding to the 
quadratic component, and so forth, in the process function. Ifthe 
level about which £ was varied were situated on the other side of 
the maximum the induced fundamental in y would be 180 deg out 
of phase with the forcing function. Other things being equal we 
would prefer to operate at a frequency where for example the sig- 
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Fig. 11 Some diagrams of importance for Evolutionary Operation 
scheme on a continuous process 


nal-to-noise ratio is least. Thus, if this were the major con- 
sideration a good choice would be the high frequency in the case 
of the disturbance in Fig. 11(ci) and low frequency in the case of 
Fig. 11(cii). This is precisely analogous to the finding discussed 
before in the discontinuous case. For the situation of low-fre- 
quency noise illustrated in Fig. 11(ci), a high-frequency signal 
will be least confounded with noise and the reverse will be the 
case in Fig. 11(ciz). 

These indications provided by the correlograms and noise spec- 
tra are of course inadequate to design the scheme in the con- 
tinuous case, for here it is necessary to consider also the time re- 
sponse of the plant itself. In practice if we introduce a step 
change in the level of the variable £ it will be some time before 
the plant reaches a steady-state condition corresponding to the 
new level. The nature of such an effect is best discussed in terms 
of the well-known frequency-response diagram like Figs. 11(f) and 
11(g). These show the reduction in amplitude and the phase 
change associated with various frequencies. Fig. 11(f) is a dia- 
gram showing the fraction of the maximal amplitude in y which 
is achieved at various frequencies of the forcing function about a 
given value of &. Using frequency-response diagrams such as 
this in conjunction with the diagram showing the noise spectrum 
we can now arrive at the best operating frequency by choosing 
that frequency giving maximum signal-to-noise ratio. 

In fact, the frequency-response diagrams themselves change 
with the levels of the process variables and a study must be made 
of their characteristics over the range of interest. 

Automatic optimization can be achieved, for example, by ar- 
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ranging that the in-phase energy in the output signal, correspond- 
ing to the observed response y at the fundamental frequency, is 
stored and resets the controller at a higher or lower average level 
when a cumulative signal at a suitable distance above the noise 
ievel is reached. To do this a careful study of the phase shift 
using diagrams like Fig. 10(g) may be necessary. The arrange- 
ments must be such that, having allowed for the shift, signals 180 
deg out of phase will drive the controller in a direction opposite 
to that taken when signals are in phase. 

The operation of the servo thus set up depends not only on the 
elements already discussed but also on a number of other factors. 
Two such factors of particular importance are: 

1 The rate and manner in which the process function is itself 
changing (for example the rate of movement of the optimal tem- 
perature level as the catalyst ages) (see for example, [15]). 

2 The amplitude of the variation induced into the operating 
variable. In general, the facility with which the optimizing proce- 
dure follows shifts in the process function will be increased as the 
amplitude of the signal is increased. On the other hand, the 
greater the amplitude of this signal, the greater the losses which 
occur each time a swing away from optimal conditions occurs. 

Each process to be optimized presents a problem which must be 
studied individually and some calibration on the plant is in- 
evitably necessary. It is necessary in each case to obtain some 
idea of the type of change which is to be expected in the maximum 
and the rate at which this might occur. The optimal scheme 
should then be computed so as to balance losses due to excessive 
amplitude against losses arising from failure to follow changes 
with sufficient speed. In preliminary studies of this kind, simu- 
lation of the system on analog and digital computers is of great 
value. 

Research including work on small self-optimizing reactors has 
been conducted by the Statistical Techniques Research Group at 
Princeton in conjunction with the Chemical Engineering De- 
partment and is being continued and extended at the University 
of Wisconsin. Theoretical work is also going forward including 
the extension of these ideas to multivariable systems. 
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Ejector-Nozzle Flow and Thrust 


An analysis for predicting the secondary and primary flows and the thrust coefficient 
of ejector nozzles is presented. Particular attention is given to the diverging shroud 
ejector in which the throat of the secondary stream is formed at a small distance down- 
stream of the primary nozzle exit; 1.e., near the plane of the minimum shroud area. 
The basic assumption in the analysis is that the shroud is sufficiently short so that the 
mixing of the two streams is incomplete, and that both streams have isentropic cores. 
The momentum thickness of the mixing region is obtained from the momentum-integral 


equation for the turbulent mixing region assuming that momentum and temperature 
diffuse at the same rate. The momentum thickness at the nozzle exit is related to the 
initial momentum thickness created by the wall separating the two streams. The exit- 


momentum thicknesses of the mixing region and the wall are used to obtain the actual 


thrust coefficient. Experimental data on primary-secondary flow properties and thrust 


Introduction 


See NOZZLES have been used extensively 
in recent years in jet engines to handle quantities of secondary air 
taken aboard by the inlet. This air is used for cooling and may 
be passed through an ejector nozzle to match the inlet and 
engine. There have been a few papers describing the action 
of ejector nozzles [1].' Most of the papers to date deal with ex- 
perimental results. No clear description of ejector-nozzle per- 
formance has been given. 

The object of this paper is to develop a method which ac- 
curately evaluates the performance of this type of nozzle with 
respect to flow and thrust. The type of nozzle considered in the 
paper is illustrated in Fig. 1. Primary and secondary streams 


1 Numbers in brackets designate References at end of paper. 

Contributed by the Fluid Mechanics Subcommittee of the Hy- 
draulic Division and presented at the Hydraulic Conference, Ann 
Arbor, Mich., April 13-15, 1959, of Tae American Society or MeE- 
CHANICAL ENGINEERS. 


Nomenclature 


coefficients of a divergent-shroud ejector nozzle show good correlation with the theory. 


meet at the exit of the primary nozzle, and the static pressures of 
both streams must be the same along the free streamline separat- 
ing them. If the secondary static pressure at the exit of the pri- 
mary nozzle is below the critical pressure of the primary, the 
primary will be choked at its exit. A Prandtl-Meyer expansion 
then occurs at this point such that the primary static pressure 
adjusts to the secondary static pressure. In this case the primary 
flow near and downstream of the primary-nozzle exit is far from 
one dimensional. If the secondary static pressure is above the 
critical pressure of the primary, the primary stream will continue 
to converge until it attains sonic velocity. The primary con- 
tinues to accelerate beyond this point supersonically as the sec- 
ondary and primary pressures continue to decrease. For this 
case the flow in the plane of the minimum shroud diameter may 
be essentially one dimensional. At the exit of the nozzle, the 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 


the Society. Manuscript received at ASME Headquarters, December 
15,1958. Paper No. 59—Hyd-5. 


A, = total exit area of ejector noz- 


zle 


primary alone defined by 
Equation (4) 


tropic flow to ambient pres- 
sure 


» = exit area of primary nozzle Cm momentum coefficient of pri- J = function 
(convergent) mary stream defined by H = shape factor for a stream, 
A,, = area of isentropic primary Equations (16) and (18) H = 6*/0 
stream at nozzle exit ; Cn momentum coefficient of sec- k = ratio of specific heats 
A,,; = area of isentropic opansed ondary stream defined by L = ejector length beyond pri- 
stream at primary nozzle Equation (16) and (19) mary nozzle exit 
Cor, Cos angularity coefficients defined M* = nondimensional Mach num- 
by Equation (31) ber defined as local velocity 
A ’ ; D, total exit diameter of ejector divided by the velocity at 
pe = area of primary stream at A, 
~ nozzle sonic conditions, V/c 
A, = minimum shroud area D,’ exit diameter area of circular M,.* M* i i i 
A,* = sonic throat area of secondary af isentropic 
anions convergent-divergent nozzle stream at nozzle exit 
A,, = area of isentropic secondary ee as M,,* = M* of isentropic secondary 
at nesale exit stream at nozzle exit 
c* = velocity of sound at sonic D, - diameter of primary noz- P, = ambient pressure 
conditions of a stream : wad (convergent) : P, = critical pressure (static pres- 
C,, = thrust coefficient defined by D, sonic throat diameter of pri- sure at sonic conditions) 
Equation (16) of Appendix mary stream P, = nozzle-exit pressure 
I D,, diameter of primary stream at P,, = secondary static pressure at 
C,, = design point thrust coefficient A, primary nozzle exit 
with zero angularity loss F, actual gross nozzle thrust P, = primary total pressure 
C, = flow coefficient of converging ideal primary thrust for isen- 
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Fig. 1 Divergent-shroud ejector nozzle 


secondary may be subsonic, sonic, or supersonic depending upon 
the shape of the shroud contour and the ambient pressure. 
Operating Characteristics of Divergent-Shroud Ejector Nozzle. 
Given an ejector nozzle with an A,/A, and an S/D,, Fig. 1, we 
may put varying amounts of secondary flow through the opening 
immediately downstream of the exit of the primary nozzle. If the 
ratio A,/A, is not too large, the primary stream will attach to the 
walls of the shroud with zero secondary flow. This will occur for 
ratios of primary total to ambient pressure greater than critical. 
As the secondary flow is increased at this same value of A,/A,, 
S/D,, and P,,/P,, the primary stream is first separated from the 
nozzle wall with a low secondary flow, and then the secondary 
stream begins to behave as a venturi approaching choked flow as 
more secondary flow is put through the nozzle. The secondary 
stream is accelerated as it flows to the minimum shroud area A, 
overexpanding the primary stream until the secondary is just 
choked at its throat. (It is assumed that the throat of the 
secondary occurs at the minimum shroud area A,; however, as 
data have shown, this throat may occur downstream of A,. The 
nozzle characteristics being described will, however, be the 
same.) Finally, after sufficient secondary flow has been put 
through the nozzle, the exit will be filled and the secondary and 
primary streams completely expanded. With additional flow 
above this amount, the nozzle will become underexpanded and 
the exit state will be independent of ambient pressure. Between 
the operating points at which the secondary flow is just choked 
and the nozzle is fully expanded there are shock waves somewhere 
between the throat of the secondary and the exit of the nozzle. 


Nomenclature 


From an inspection of the data available on divergent-shroud 
ejector nozzles, it is found that the minimum thrust coefficient at 
a constant ratio of secondary to primary flow occurs at the point 
at which the secondary just acts as a choked venturi; i.e., when 
the secondary flow is just choked. As the secondary flow becomes 
unchoked by decreasing P,,, the thrust coefficient again increases 
until it reaches approximately that of the primary nozzle alone. 
This latter thrust coefficient corresponds to a small secondary flow 
sufficient to separate the primary stream from the shroud walls. 
After the secondary becomes choked and P,, is increased, the 
thrust coefficient also increases as the shock waves in the streams 
move from the secondary throat to the exit of the nozzle and dis- 
appear. At this point the nozzle is completely expanded and the 
thrust coefficients are again high, reaching values of 97 to 98 per 
cent if angularity losses are negligible. 

In this paper the following basic assumptions are made in the 
theoretical treatment of the primary and secondary flow and the 
thrust coefficient for diverging-shroud ejector nozzles. 


1 The primary and secondary streams are only partially 
mixed at the nozzle exit; i.e., the boundary layers of the wall and 
mixing region have not met at the nozzle exit. 

2 The throat of the secondary occurs in the plane of the 
minimum shroud area A,. 

3 Asimple expansion of the primary stream from its throat to 
the throat of the secondary gives the location of the free stream- 
line if the secondary pressure is below the primary critical pres- 
sure at the primary nozzle exit. This assumption is made for 
small values of the spacing ratio S/D, compared with unity,* 
since in this case the main stream has little influence on the free 
streamline. 

4 The flow is one dimensional for both secondary and pri- 
mary streams in the plane of the minimum shroud diameter if 
the secondary static pressure is above the primary critical pres- 
sure at the primary nozzle exit (the primary throat formed 
downstream of this exit). 

5 The secondary flow is one dimensional. 

6 For the rapidly converging shroud the wall boundary-layer 


* For nozzles employed on propulsion devices S/D, is necessarily 
small, usually less than 0.3. 


secondary total pressure 

distance between primary 
nozzle exit and minimum 
shroud area 

local absolute temperature in 
mixing region 

local free-stream absolute pri- 
mary temperature 

primary total absolute tem- 
perature 

local free-stream absolute sec- 
ondary temperature 

secondary total absolute tem- 
perature 

local velocity in mixing region 

local free-stream primary ve- 
locity 

exit free-stream primary ve- 
locity 

local free-stream secondary 
velocity 

exit free-stream secondary 
velocity 
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mass rate of flow 

distance measured along 
streamline 

distance measured perpen- 
dicular to streamline and 
also defined by Equation 
(5) and Fig. 2 

dimensionless distance for ex- 
pressing similarity of ve- 
locity profiles in mixing re- 
gion 

= (M,°/M,*) 
(T',/T ty) 


T/T. 


thickness of secondary stream 
mixing region 

thickness of primary stream 
mixing region 

displacement thickness of en- 
tire mixing region, defined 
by Equation (40) 

momentum thickness of entire 


mixing region defined by 
Equation (41) 

= angle between free streamline 
and axis of nozzle 

= empirically determined pa- 
rameter for mixing-region 
velocity profiles where z = 
oy/z 

= half angle of divergent shroud 


ambient 

actual 

exit plane of nozzle 

effective for isentropic flow 

ideal (isentropic to ambient pres- 
sure) 

isentropic 

primary stream; also primary 
nozzle exit plane 

secondary stream; also plane of 
minimum shroud area 

total 

wall 
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displacement thickness is neglected at the minimum shroud area 
A, in calculating the secondary-throat area at this position. 


We shall be concerned with the case in which the secondary 
throat occurs in the plane of A, with relatively small values of 
S/D, as compared with unity. The experimental comparisons 
with data obtained from scale-model tests of ejector nozzles ex- 
tend over a range of values of S/D, from 0.120 to 0.229. 

Thrust coefficients will be analyzed in terms of the momentum 
thicknesses of the wall and mixing region boundary layers and 
isentropic core flow to the nozzle exit. 


Theoretical Analysis 


Flow Characteristics of the Ejector Nozzle. The secondary flow rate 
may be expressed as 


W, VT, = const P,,A,* (1) 


A similar expression holds for the primary flow rate. Hence the 
ratio of corrected secondary to primary flows w is given by 


W, \T 5 

For sufficiently low ambient pressures, the throat of the secondary 
may be formed at any point between the exit of the primary and 
the exit of the nozzle, depending upon the shape of the shroud 
contour. In the diverging-shroud nozzle, the shroud is shaped 
such that the secondary throat usually occurs near the minimum 
shroud diameter except for very small secondary flows in almost 
cylindrical shrouds, Fig. 2. If this is the case, Equation (2) may 


be written as 
A, 
= —{— - 3 
(4: @) 


The ratio A,/A,* is not directly available or measured. How- 
ever, in terms of the flow coefficient, this area ratio may be 
written as 


A,/A,* = Ap”) 
= (A,/A,)(1/Cy)(A (4) 


where A,/A, is the actual physical area ratio and A,,,,/A,* is 
evaluated at the primary nozzle exit plane static to total pressure 
ratio. When the primary stream is choked at its nozzle exit 
(plane of A,) this latter area ratio is unity. In this case, the 
secondary static and total pressures at the exit of the primary 
nozzle are assumed to be equal. 

Consequently, we may state the following for k = 1.4: 


Ays/A,* = = 1; 0.528 (5) 


The flow coefficient C, must be known or evaluated. 
The minimum actual shroud area A, is used since the wall 


Pe decreasing 


\ free streamline 


Fig. 2 Free streamline in a divergent-shroud ejector nozzle 
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boundary layer of the secondary stream is assumed negligibly 
small at A,. The primary area A,, in the plane of A, is calculated 
on the basis of isentropic flow from A,,,, = C,A, since the 
boundary-layer displacement thickness of the mixing region 
changes little for small values of S, Fig. 2. 

If the secondary pressure at the exit of the primary nozzle is 
greater than the primary critical pressure, A,,/A,* is evaluated on 
the basis of one-dimensional flow for both streams in the plane of 
A,. Since the secondary is choked at A,, the static pressure of the 
secondary in this plane must be equal to P, (0.528 P,, for k = 
1.4). Consequently, the area ratio of the primary stream 
A,,/A,* may be found from the ratio of its static to total pressure 
or 0.528 P,,/P,, fork = 1.4. 

It is desired to develop a simple method for predicting the flow 
characteristics and the thrust coefficients of an ejector nozzle. 
For the nozzle shown in Fig. 2, we shall assume that the shroud 
is shaped such that the secondary throat is formed at A,. If the 
secondary static pressure is sufficiently low at the point where the 
streams meet, the primary stream will undergo an expansion 
through a decreasing pressure until the throat of the secondary is 
reached. For a rapidly converging shroud and for values of 
S/D,* not too large compared with unity, we shall assume that 
the area of the secondary stream is determined primarily by the 
shroud contour up to the throat of the secondary. The primary 
contour or free streamline to the throat of the secondary is as- 
sumed for the case of P,, = P,, < 0.528 P,, to be obtained from 
an initial Prandtl-Meyer expansion to the total pressure of 
the secondary followed by a linearly decreasing pressure of the 
secondary. This linear decrease in pressure requires the sec- 
ondary area to vary as the inverse of the usual isentropic flow per 
unit area curve plotted against the static to total pressure ratio. 
Such a variation of area is approximated by the rapidly converg- 
ing shroud shown in Fig. 1. Assuming that the boundary layer 
is of zero thickness at the beginning of the shroud contour, we 
shall neglect the displacement thickness of the shroud boundary 
layer at the throat of the secondary. Also, for small values of the 
distance S the mixing of the two streams will be negligible. Hence 
the diameter of the primary is expressed as 


D,, = D,* + 2y (6) 


and 


A,,/A,* = (D,,/D,*)* = (1 + 2y/D,*)* (7) 


where y is the distance from D,* to the free streamline and is 
measured perpendicular to the nozzle axis. This distance at the 
secondary throat may be expressed as 


y,/D,* = (S/D,*) f, tan vd(z/S) (8) 


and may be obtained by a numerical integration over z/S with 
the values of vy obtained from tables of turning angle for super- 
sonic flow [2]. 

If the secondary total pressure or static pressure at the exit of 
the primary nozzle is greater than the critical pressure of the pri- 
mary, the primary stream will continue to contract until the 
secondary pressure reaches the critical pressure of the primary. 
The primary will then expand through decreasing secondary 
pressures until the throat of the secondary is reached. Equations 
(7) and (8) have been combined with Equations (3) and (5) to 
express directly the relationship among w, P;,/P,,, A,/A,,, and 
S/D,* for the case in which the throat of the secondary occurs 
in the plane of the minimum shroud cross section. The results 
are shown in Fig. 3. A straight line through any three of the 
variables on this chart determines the fourth. 

As the ratio of secondary total to primary total pressure is in- 
creased, the pressure in the exit plane of the primary nozzle 
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Fig. 3 Secondary primary flow ratio; k = 1.4 for both streams 


eventually becomes large enough to unchoke the primary stream 
in this plane. The assumption made was that the initial sec- 
ondary Mach number is approximately zero or that the pressure 
in this plane is the secondary total pressure. Therefore the sec- 
ondary total to primary total pressure ratio at which the primary 
nozzle unchokes (and the throat of the primary stream moves 
downstream) is the critical pressure ratio for the primary (for 
k = 1.4, P,,/P,, = 0.528). 

As P,,/P,, increases, the ratio of secondary to primary flow 
and the initial secondary Mach number increases. The primary 
flow becomes more nearly one dimensional, since it does not turn 
a corner at the exit of the primary nozzle after P,,/P,, be- 
comes greater than the critical-pressure ratio. Therefore for 
P,,/P,, greater than critical we shall assume that the flow is one 
dimensional. Hence the spacing S is no longer a variable in the 
problem. In place of S we shall now substitute a variable which 
will account for finite initial secondary Mach numbers. The pres- 
sure in the plane of the primary nozzle exit is now the secondary 
static rather than total pressure. However, a more useful variable 
is the initial effective flow area of the secondary stream at the exit 
of the primary nozzle. 

Equation (3) remains valid, but the size of the primary stream 
in the plane of the minimum shroud area is evaluated on the basis 
of one-dimensional flow. Since the throat of the secondary is 
assumed to occur in the plane of the minimum shroud area, the 
static pressure there is the secondary critical pressure. The ratio 
of the primary area there to its throat area A,,/A,* is thus a 
function of the ratio of this static pressure to the primary total 
pressure for isentropic flow, or for k = 1.4, A,,/A,* = (0.528 
P,,/P 15). 

The area ratio A,/A,* in Equation (3), however, is now not 
simply A,/A,, because the primary throat is downstream of the 
primary nozzle exit. This area ratio may be expressed as follows: 


A,/A,* = (9) 


where A,/A,, = A,/(C,A,). The area ratio A,,/A,* is again a 
function of the static pressure at A,, to the primary total pressure 
for isentropic flow. The static pressure at A,, is the initial sec- 
ondary static pressure P,,. We may write, therefore, 
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Pin. 
A,,/A,* Si(Pop/P ry) fi ( (10) 


P,, P., 

Since the initial effective secondary area A,, in the plane of the 
primary nozzle exit has been selected to be an independent varia- 
ble rather than the pressure P,, we may eliminate this pressure 
for isentropic flow since 


= f2(A,./A,*) = 4° 
The quantities A,,/A,* and A,*/A,* may be eliminated among 
Equations (2), (10), and (11). The resulting equation may be 


written as 
P 
Pei Pe 


This equation may be combined with Equation (3) which is re- 


written as 


Combining Equations (12) and (13) to eliminate P,,/P,,, we note 
that the relation between w and A,/A,, is linear. A simple 
nomograph may be constructed since there are only four 
variables and a linear relation between two of them. The results 
are plotted in Fig. 3 as an extension to the case for which P,,/P,, 
is less than the primary critical-pressure ratio. It appears from an 
inspection of Fig. 3 that the transition between the two parts of 
the theory is not completely smooth at the critical-pressure ratio 
of 0.528. However, the flows obtained from the combined 
theories form a fairly smooth curve if w is taken to approach as- 
ymptotically that predicted by the one-dimensional solution for 
P,,/P, greater than about 0.8. 

Thrust Coefficient of the Ejector Nozzle. The thrust coefficient is 
defined as the total axial momentum of both primary and sec- 
ondary streams plus the product of the exit area and the difference 
between the exit and ambient pressures all divided by the isen- 


A 
A 


P,, (12) 
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‘tropic thrust obtained from both streams if expanded to ambient 


pressure. Hence the thrust coefficient is expressed as 


on (WV (WV sec (P, P,)A, 


Cc 
5.)ia + (WV dia 


(14) 


Tne development of the thrust coefficient is given in the Appendix. 
The results may be expressed as 


P 
CosC + CoC + 1.355 ( 


From an inspection of the data, this error appeared to be in the 
data rather than in the theoretical thrust coefficient. 

Although a '/2 per cent error in predicted thrust coefficient is 
good, potential sources of error are: 

1 The initial momentum thickness of the secondary stream 
was assumed zero ahead of the mixing region (see Appendix IT), 
2 The exit angle of the primary stream and its angularity loss 


A, 
A,* 


Cy = 


M wM,,ia* 


(15) 


where M,,;,* and M,,;,* are obtained from a one-dimensional isen- 
tropic flow analysis to the exit of the nozzle as shown in Appendix 
I. The results of this analysis are given in Fig. 9 from which 
‘P,/P, is obtained from P,,/P,,, #, and A,/A,* by drawing a 
straight line through the three known quantities. C,,, is given in 
Fig. 7 as a function of the thrust coefficient for the simple con- 
verging nozzle alone at design point; i.e., at the critical pres- 
sure ratio of 0.528 if the nozzle is choked or at the nozzle 
pressure ratio P,,/P,, if it is unchoked. In this manner all dis- 
tortions in the velocity profile are assumed to be expressible in 
terms of a single momentum thickness at the wall. If the design 
point thrust coefficient for the converging nozzle is not known, 
Cn» may be calculated from boundary-layer mechanics [3], Equa- 
tions (47) and (49) of Appendix II. C,,,, the secondary momen- 
tum coefficient, is plotted in Fig. 8 as a function of the design- 
point thrust coefficient of a simple converging-diverging nozzle 
with the same wall length as the shroud of the ejector nozzle and 
the same exit Mach number and area as the secondary stream 
of the ejector nozzle. The angularity coefficients Cy, and C,, for 
the two streams are obtained from Equation (31) of Appendix I. 
The quantities M,,,,* and M,,;,* are obtained from the total 
primary to ambient and total secondary to ambient pressure ra- 
tios, respectively. These quantities differ, of course, from 
M,,:,* and M,,;,* which are found from isentropic flow to the 
actual exit pressure. 


Results and Discussion 


Nozzle Flow. The type of ejector nozzle shown in Fig. 1 is 
characterized by small ratios of S/D, and a diverging shroud. 
The diverging shroud permits supersonic expansion of both the 
secondary and primary streams. The theoretical corrected sec- 
ondary to primary flow ratios may be obtained from Fig. 3 and 
are shown in Fig. 4 with data. The values of w are independent 
of P,,/P, since both streams are choked. For each ejector 
nozzle, therefore, w is only a function of P,,/P¢,. 

Fig. 3 includes zero values of the spacing ratio, S/D,*. The 
zero value is valid only with a flow coefficient of unity and 
negligible primary nozzle wall thickness. Combination of Equa- 
tions (3), (7), and (8) tacitly assumes that the entire area between 
the primary streamline and the shroud is available for secondary 
flow. Such is obviously not the case for a zero value of S and 
a flow coefficient less than unity. Consequently, in the case of 
actual nozzles which have a flow coefficient less than unity and a 
finite wall thickness, the use of Fig. 3 is not recommended for 
values of S/D,* less than about the effective displacement thick- 
ness of the primary nozzle. 

Thrust Coefficient. Equation (15) for the thrust coefficient is 
compared with experiment in Figs. 10 and 11 for the nozzles of 
Fig. 4. It is noted that in this figure, the actual thrust coefficients 
generally fall within about '/: per cent of the theoretical. There 
is a 1 per cent discrepancy between the theoretical and actual 
thrust coefficients in Fig. 11 for the value of P,,/P,, = 0.395. 


* The data are from General Electric Company reports. 
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cannot be determined exactly without knowing the contour of the 
primary stream. 

3 A one-dimensional analysis does not give the deviations of 
the exit velocity from that of a uniform one. 


However, in general, the correspondence between the actual 
and the theoretical thrust coefficient is good. 

The shock interaction between the secondary and primary 
streams for overexpanded nozzles is not treated analytically in 
this report. However, the calculation of the thrust coefficient 
may be extended somewhat below design point values of P,,/P, 
with Equation (9) because the flow remains essentially isentropic 
to the nozzle exit. The nozzle exit pressure falls below ambient 
and all compression is assumed to occur across the oblique shock 
at the nozzle exit. However, as P,,/P, decreases farther, the 
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oblique shock steepens, the flow s~parates from the nozzle wall, 
and the shock moves upstream. When this occurs, Equation (9) 
is no longer valid. 

Small Secondary Flows. The principal assumption in the analysis 
presented is that the secondary stream has an isentropic core to 
the nozzle-exit plane. However, as the secondary flow is de- 
creased, the shroud wall and mixing-region boundary layers 
close. For ejector nozzles used as propulsion devices, the axial 
length of the shroud is about the same as the primary nozzle 
diameter; i.e.,L/D, ~ 1. Ifthe primary stream discharged into 
quiescent air it would entrain about 6 per cent secondary flow in 
a distance of one diameter downstream. However, since the 
secondary flow is rapidly accelerated, the ratio of the local free- 
stream secondary to primary velocity increases, and the shear 
stress between streams, which accounts for the entrained flow, 
decreases [3]. Consequently, we would expect the entrained 
secondary flow to be somewhere between 0 and 6 per cent of the 
primary stream. 

From an inspection of data‘ it is seen that the prediction of w 
is low by about 0.02. On an absolute flow basis, the prediction is 
good. An inspection of shroud-pressure measurements on conical 
divergent shrouds‘ indicates that for the very low secondary 
flows, w less than about 0.03, the throat of the secondary moves 
downstream of the plane of minimum shroud area. However, the 
static pressures decrease slowly between the minimum shroud 
area and the secondary throat. This phenomenon caused by 
negative friction delays choking of the secondary stream, an 
effect opposite to that of positive friction in one-dimensional 
flow [2]. However, as suggested by the small change in sec- 
ondary static pressures between the plane of the minimum shroud 
area and the secondary throat, the calculation of secondary flow 
on an isentropic basis to the plane of the minimum shroud area is 
a good equivalent approximation. 

An important consequence of neglecting the entrained flow and 
deviations in one dimensionality of the primary flow is that the 
secondary flow is usually larger than predicted for small w. This 
phenomenon is also evidenced by the fact that for fully choked 
nozzle operation some secondary to primary flow ratios yield sub- 
sonic secondary exit states on the basis of a one-dimensional flow 
calculation (above the secondary sonic line in Fig. 9). Such 
states are impossible. In addition, on the basis of one-dimen- 
sional flow at the exit plane with the secondary throat assumed 
located there, the differences between measured and theoretical 
flow are much larger than those indicated by the theory used in 
Fig. 4. Therefore the exit state calculated on the basis of one-di- 
mensional flow is not correct in some cases. For nozzle geome- 
tries encountered these errors are usually for low secondary 
flow and small exit to primary nozzle area ratios. 


APPENDIX 1 
Thrust Coefficient of the Ejector Nozzle 


The thrust coefficient is defined as the total axial momentum 
of both primary and secondary streams plus the product of the 
exit area and the difference between the exit and ambient pres- 
sures all divided by the isentropic thrust obtained from both 
streams when expanded to ambient pressure. Hence, the thrust 
coefficient is given as 


CoC pete + + (P, — P.)A, 


Cy (16) 


where Cy», Cy, are angularity coefficients and where C,,, and C,,, 
are momentum coefficients of the primary and secondary streams, 
respectively, and are expressed in terms of momentum thickness 


* Unpublished General Electric Company data on ejector nozzles 
with divergent half angles from 0.7 to 15 deg. 
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Pris 


Fig. 6 Ratio of final to initial momentum thickness for mixing of two 
accelerating streams 


of the mixing region and the wall boundary layer. Using the 
mixing-region momentum thickness of the combined secondary 
and primary streams at the exit of the nozzle which is given in 
Appendix II, the actual primary momentum may be expressed 
as® 


= WV = — (OpV pe) (17) 

oP 
since 9,, the momentum thickness of the mixing region at the exit, 
represents the thickness of free-stream momentum which is lost 
as compared to the isentropic momentum. With the continuity 
equation, the actual primary momentum is given by 


pe 
0, 40 1 ] 
Hence C,,,, becomes 
4 1 


where 6,/6 is the ratio of the momentum thickness at the exit of 
the nozzle to that at the beginning of the mixing region. This 
quantity is determined in Appendix II and is plotted in Fig. 6. 
However, C,,, may be obtained directly from Fig. 7 if the thrust 
coefficient at design point is known for the converging primary 
alone. Fig. 7 was obtained from the fact that the design-point 


5 Combination of the momentum of the entrained secondary with 
the primary stream results in considerable simplification. Hence @ 
represents the momentum thickness of the entire mixing region based 
on the properties of the primary stream. 
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Fig. 7 Primary momentum coefficients 


thrust coefficient for a simple converging nozzle is given by 
(50) 


where the design point for a simple converging nozzle is at the 
critical-pressure ratio of 0.528 for normal air. 

Since the effect of the mixing region on the secondary stream 
has been included already in the primary momentum coefficient, 
the only loss in momentum of the secondary stream is due to wall 
friction. The momentum thickness due to the wall-boundary 
layer at the exit of the shroud may be obtained from [3], page 
478. However, a more direct method of obtaining the secondary 
momentum coefficient C,,, is given by the following: 


_ 


A (19) 


se 
After removing the angularity correction, the thrust coefficient at 


design point for a simple converging-diverging nozzle may be 
written as 


(20) 
ac 

For the converging-diverging nozzle with a contour length 

equivalent to that of the ejector shroud and for the same leaving 

Mach number as the secondary stream of the ejector, the mo- 

mentum thicknesses will be equal and the momentum thickness 

over the exit area may be written as 


7,/A,, = (1 — C,,)/D,’ (21) 


Combining Equations (19) and (21), we obtain the momentum 
coefficient of the secondary stream in terms of the thrust co- 
efficient of a simple converging-diverging nozzle at its design 
point. The resulting expression is 

Cue = 1 — (1 — C,,)D,/D,’ (22) 


Since the exit areas must be the same, the following relationship 
also holds: 
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Fig. 8 Secondary momentum coefficient 


4A,,/e = D,'? = — D,,2/D,2) 


(24) 


(24a) 
A, 


and 


The momentum coefficient of the secondary stream for use in 
Equation (16) is plotted in Fig. 8 as a function of the ratio of the 
primary exit area to nozzle exit area. The design-point thrust co- 
efficient of a CD nozzle without angularity loss is often taken 
from about 0.975 to 0.985. 

Dividing both numerator and denominator of Equation (16) by 
the ideal primary momentum, we obtain 


V 


ee ms > 
peid W 2 


W, ie 


(25) 


Myesa” \T 


(26) 


Vos is 
V peta 


M V este M 


V 


peid 


The quantity P, — P, is zero when the secondary is subsonic, 
when there is a shock in the nozzle, and when the nozzle is operat- 
ing at design pressure ratio. However, for all other conditions 
this quantity may be expressed as 


(P, ( 
WV 


Using the expression 


Equation (27) may be reduced to the following form: 
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WV M Py Py A,* 
where A,* is equal to C,A, when the secondary static pressure is 
less than 0.528 P,, at the exit of the primary nozzle (i.e., the pri- 
mary throat is formed at the primary-nozzle exit). When the 
throat of the primary is not formed at its nozzle exit, the follow- 
ing identity may be written: 

A, 


where A,, = C/A, 
and A,,/A,* is obtained from the ratio of the static secondary 
pressure at the primary exit to the primary total pressure or 
P,,/P 1. This regime of operation is treated in the text. 


For normal air for both the primary and secondary streams, 
Equation (25) for the thrust coefficient becomes 


P 
+ Cot + 1.355 ( 


M + OM 


The angularity coefficients C,, and C,, will be assumed to be 
given by the normal quantities for conical flow. For convenience 
and since the exact shape of the free streamline is not known, we 
shall take the following for the angularity coefficients: 


Cop = 1/2(1 + cos Cy, = cos (31) 


where ¢ is half of the included angle of the secondary shroud. 
Cy, is taken as the cos ¢ only since all of the secondary stream 
fiows near the shroud boundary. 

To use Equation (30) the quantities M,,,,*, M,,,,*, and P,/P,, 
remain to be evaluated. When the ejector nozzle is underex- 
panded, we must determine the primary and secondary Mach 
numbers in the nozzle exit plane 


We may write the secondary flow area at the nozzle exit as* 
A, = A, Ay (32) 
Equation (2) may be written as 
P,A,,(A,*/A,,) 
Equations (32) and (33) may be combined to yield 
x (Al Ay? Ap/A,”) 
Pe 


(33) 


Also from the isentropic flow relations 


A,,/A,* = I(P,/P 5) 


Hence Equation (34) becomes 
[A,/A,° I(P./F 


w = P,,/P,— ( P, (37) 
f Pw ) 


¢ The actual area for effective one-dimensional flow of the secondary 
must be reduced by the displacement thickness of the wall boundary 
layer. However, the entrained secondary flow in the mixing region 
yields a displacement thickness for the secondary stream which is 
negative or which tends to increase the effective isentropic area for the 
secondary stream. This displacement thickness moves the effective 
free streamline toward the nozzle axis. The displacement thickness of 
the primary stream will move the free streamline to a larger diameter 
than the isentropic primary since there is a decrease of velocity due to 
the deceleration of the primary by the secondary stream. The net 
effect on the free streamline is a movement towurd the axis since the 
displacement thickness of the primary is smaller than that of the 
secondary, or an increase in area for the secondary. The net effect of 
this free-streamline displacement, coupled with the wall boundary- 
layer displacement thickness for smooth walls, will tend to com- 
pensate leaving the effective isentropic secondary area at the exit of 
the nozzle approximately the same as that obtained when neglecting 
both the wall boundary-layer displacement thickness and the net 
displacement thickness of the mixing region. 


Secondary Sonic 


Fig. 9 Nozzle-exit state for two isentropic streams unmixed in one-dimensional flow 
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Fig. 10 Ejector-nozzle thrust coefficient 


The last equation has four variables and is linear inw and A,/A,*. 
Therefore Equation (37) may be plotted simply in nomograph 
form. The result is given in Fig. 9. A straight line through any 
given three of the four variables in this figure will determine the 
fourth. Knowing w, P,,/P,,, and A,/A,* we may determine the 
actual exit state of the ejector nozzle, P,/P,,; P,/P,, is obtained 
from (P./P ug) 


APPENDIX Il 


Mixing of Primary and Secondary Streams in an Ejector 
Nozzle 


The mixing region in an ejector nozzle follows the free stream- 
line shown in Fig. 2. An initial boundary layer exists on the walls 
of the secondary and primary streams before the streams meet. 
We wish to determine the loss in momentum of the combined 
streams at the exit of the nozzle as compared with isentropic flow 
of the two streams. From the momentum integral equations of 
the mixing region for axially symmetric flow, using the condition 
that there is no shear stress at the outer boundaries of the mixing 
region, we obtain the following expression: 


_ a dP 
-U, i r pudy = —r,(6, + 4,) 


The momentum thickness 6 of the combined streams based on 
the primary free-stream momentum is defined as 


(38) 


bp : 0 
= f, pu(U, — u)dy + —u)dy (39) 


The momentum thickness is the width of a stream at the free- 
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Fig. 11 Ejector-nozzie thrust coefficient 


stream isentropic velocity and density which represents the loss of 
momentum due to a velocity distribution different from the isen- 
tropic velocity near the boundaries of the flow. If we determine 
the momentum thickness of the mixing region at the exit of the 
ejector nozzle, we may calculate the loss of momentum of 
the two streams based on the isentropic momentum leaving the 
ejector nozzle. The quantity 6* represents the displacement 
thickness of the mixing region for the secondary and primary 
streams based on the primary free-stream flow and is defined as 


dp 0 pu 
p,U ,6* f (p,U, — pu)dy + {1 dy 
0 — 8, se 


(40) 


In addition, for the free streams just outside the mixing region 


(41) 


With Equations (39), (40), and (41), the momentum equation of 
the mixing region may be reduced to 


d d A, \'” d p 
— (log 0) + — | log | — + — [log 
dz dx [1 ) ] dz ( Pep ) 


d 
+ (2 + 6*/8) , (log M,*) = 0 (42) 


This equation is integrable if we know the variation of the ratio of 
displacement thickness to momentum thickness or shape factor as 
a function of distance along the mixing boundary. 

If the shape factor is either constant or small compared with 2, 
Equation (42) is immediately integrable. It is assumed that the 
velocity distribution of the initial boundary layer which is formed 
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by the wall separating the secondary and primary streams ahead 
of the mixing region rapidly transforms into that given for the 
mixing of two streams (page 491 of Reference [3]). Equations 
(39) and (40) are used to determine the shape factor. Using the 
velocity distribution given in [3] and assuming that the tem- 
perature distribution is of the same form, we may write 


v, -V, 

u = '/(V, + V,) (: (43) 

and T = '/,(T, + (1 + 2) (44) 
+f, 


Since the static pressures outside of the mixing region are the 
same in the primary and secondary streams, we may write 


p/p, = T,/T; p/p, = T,/T (45) 


Using Equations (39) and (40), we obtain the shape factor in the 
following form: 


B-a 


The quantity 40,/D, may be obtained in terms of the design- 
point thrust coefficient of the primary converging nozzle. Since 
the thrust coefficient of a converging nozzle at design point may 
be written as 


40, 


= 


(50) 


we obtain the momentum coefficient for the primary stream of the 
(1 — C,,)(0,/60) 


assuming A, ~ A,* for this second-order correction. 

The results of Equation (51) have been plotted in Fig. 7. In 
this figure it is seen that the primary momentum coefficient is 
always less than unity as it must be and decreases with decreasing 
ratios of 7',,/T,,. It is interesting to note that, theoretically, the 
momentum coefficient due to mixing increases at a total tempera- 
ture ratio of unity from its initial value and decreases from its 


ejector nozzleas C,, = 1 — (51) 


6* (B — a)(l+ ar k= (1 — erf*z) — (1 ert dz 


—=H = 0.125 —____ 
6 


where 
a = V,/V, = (M,*/M,*)(T,, T 2 
and 
B= 7,/T, = (T./T 


The shape factor has been computed from Equation (45) for 
several different values of the ratio of 7,,/7',, and for a range of 
values of M,* from 1.2 to 1.8 and M,* from 1to1.4. The follow- 
ing results were obtained: 


T's/T tp /O)max (6*/0 min (6*/@)ave = H 
1 —0.600 —0 355 —0.48 
1/3 —1.347 —1.293 —1.32 
1/6 —1.661 —1.558 —1.61 


It is noted that as H becomes larger with respect to 2, it also 
becomes more constant. Hence H,,, will be used in Equation 
(42). 

At the point where the streams meet, it is assumed that the 
boundary layer of the secondary stream is negligible if the 
secondary stream Mach number there is very small. The shape 
factor for an accelerating stream near a wall is approximately 1.1. 
Therefore the initial shape factor must change from a value of 
about 1.1 to a negative value. However, since it has been as- 
sumed that the velocity distribution in the mixing region rapidly 
becomes that given by the mixing of two streams, the initial value 
of 1.1 has been neglected. 

Using the values listed for different ratios of 7,,/T,,, we may 
integrate Equation (42). The result is 


= 0.6339(p5,/Pip) "(A 2 *™ (47) 


where H takes on the values listed before for different ratios of 
T,,/T,. The momentum coefficient C,,, (see Appendix I) may 
be expressed as 


Cap = 1 — 40,/D,, (48) 
or Cup = 1 — 0,/00 (49) 
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erf z(1 — erf z)dz 


initial value at the total temperature ratios of 1/6 and 1/3. The 
quantity 0,/4 is also plotted against the isentropic primary M,,* 
in Fig.6. Therefore, knowing the initial value of the momentum 
thickness 4 it is possible to determine the momentum thickness 
at the exit of the mixing region. 
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DISCUSSION 
J. Fabri and E. Le Grives’ 


As many authors have previously pointed out, a one-dimen- 
sional description of ejector performance can be made only on a 
simplified scheme of this apparatus, with the assumption that the 
entrainment and mixing process occur either under constant 
pressure or in a constant area mixing-tube [1].* 

The analysis suggested by Weber applies to a more general case, 
in which the ejector geometry, that is a convergent-divergent 
nozzle, results from design operation imperatives and is suited at 
least to one of the following two flow patterns: 

1 In the first case, the primary flow is under-expanded, and the 
converging shroud of the ejector is so contoured as to make the 
induced flow pressure linearly decrease with the axial distance 
from the primary nozzle exit plane. Such an assumption is 
meant to allow a simple description of the free stream-line shape 
and implies a very steep convergence of the shroud inlet; there 
is no proof that this particular secondary area variation, which 
- 7 Office National d'Etudes et de Recherches Aéronautiques, Paris, 

rance. 

® Numbers in brackets designate References at end of discussion. 
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can hardly be related to a one-dimensional analysis anyhow, will 
give optimum performance. 

2 In the second case, when the secondary total pressure ex- 
ceeds the primary critical pressure, Weber assumes that the in- 
duced flow is accelerated until it becomes choked, the secondary 
throat occurring precisely in the plane of minimum shroud-area, 
where the pressure is assumed uniform. The former condition 
is similar to that of the so-called “‘supersonic flow pattern’’ of the 
aerodynamic theory worked out at ONERA [2, 3], to which the 
author refers in his bibliography; this pattern, however, implies 
a supersonic primary jet, and a constant area mixing tube. One 
may seriously question whether the induced flow can be ac- 
celerated to sonic velocity by a permanently subsonic primary 
flow (even if the divergence between total temperatures is ac- 
counted for), or, if the former assumption is taken for granted, 
whether the secondary throat coincides with the minimum shroud 
area. 

The linear trend of experimental data suggests that a single 
flow pattern must be prevailing for every one of the nozzle shroud 
contours probed; these data are shown against characteristic 
curves calculated [4] with the assumption of a supersonic flow pat- 
tern corresponding to values of the mixing tube to primary nozzle 
exit area ratio equal to the actual values of A,/A,. Although 
this theoretical scheme differs notably from the actual geometry 
and cannot apply to an accurate description of experimental data, 
these appear to lie rather close to the quasi-linear characteristic 
curves so devised, except for case no. 2. 

As regards the over-all thrust coefficient calculation, the as- 
sumption of isentropic expansion for both the secondary and 
primary cores, still unmixed at the nozzle-shroud exit, is in direct 
relation with the theoretical analysis of the afore-mentioned 
study, as regards the supersonic flow pattern. The close correla- 
tion of experimental data for nozzle thrust coefficient, internal 
and wall friction losses being accounted for by means of the 
accurate procedure devised by the author, appears as another 
confirmation of the theoretical analysis worked out at ONERA. 
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Author’s Closure 


The author wishes to thank Messrs. J. Fabri and E. Le Grives 
for their constructive discussion of the paper. The statements 
regarding the thrust coefficient calculation were much appreciated 
since no other check on the procedure utilized in the paper for 
calculation of thrust coefficient has been obtained. 

It is true that, if the secondary flow is made to converge as it 
enters the shroud and if much of it enters at an angle to the axial 
direction, the secondary flow will be far from one-dimensional. 
In some General Electric Company experiments static pressure 
taps were located on the external surface of the primary nozzle 
near its exit and on the converging portion of the shroud. The 
pressure on the outside of the primary nozzle exit indicated nearly 
secondary total pressure for secondary flows tested (less than 
about 30 per cent of the primary flow). The shroud taps read 
lower pressures along all points of the shroud, both converging 
and diverging. Experimental data on primary flow coefficients 
under ejector operation were obtained from nine different con- 
figurations similar to those shown in Fig. 1 of the paper. The 
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results of these experiments were compared with the theory as 
given by Equation (4) of the text where, 


A,*/A, = (A,*/A MA pretn/A,) = 


As indicated after Equation (4), Apcerm/A,* is evaluated at the 
primary nozzle exit plane static to total pressure ratio. The 
theoretical line shown in Fig. 12 is obtained by assuming 
that the primary nozzle exit plane static pressure is equal to the 
secondary total pressure as indicated by pressure tap data. Only 
two of the configurations are shown, but all of the other nozzles 
gave similar results when compared with the theory. The choked 
flow coefficient C, (when the secondary total pressure is less than 
primary critical pressure) must be known or obtained from 
experimental data. The break points in the curves shown in 
Fig. 12 correspond to unchoking of the primary nozzle by the 
secondary flow or to P,, = 0.528 P,,, primary critical pressure. 

Referring to case 2 of the discussion by Fabri and Le Grives, 
we note that if P,, < P,,, the primary nozzle will reach sonic 
velocity before the secondary does. This implies that initially 
downstream of the primary nozzle exit both streams are subsonic 
and must converge. After reaching sonic velocity, the primary 
will diverge while the secondary continues to converge. All of 
this behavior is assumed to occur between the primary exit and 
the minimum shroud diameter. The axial distances involved are 
small and one-dimensional flow is again questionable. However, 
the gross behavior as discussed here may still be the same al- 
though the secondary throat may occur downstream of the mini- 
mum shroud diameter. Shroud pressure measurements indicate 
that the throat of the secondary does move downstream as the 
secondary flow becomes nearly zero and the shroud becomes 
cylindrical. 
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Pressure Losses in Smooth Pipe Bends 


The results of extensive experimental studies to determine the pressure losses for tur- 
bulent flow in smooth pipe bends of circular cross section are presented in this paper. 


To make the data usable in practical design problems, the results are discussed in re- 
lation to those found by previous investigators, and empirical formulas for the bend-loss 


coefficient are given. 


The general correlation of the test data appears to be as good as 


our present test information will permit. 


Introduction 


HE PROBLEM OF DETERMINING the pressure loss 
produced by a bend has been the subject of research for many 
years, because of its considerable engineering importance in the 
design and analysis of fluid machinery and piping systems. How- 
ever, although many investigators have studied the problem, the 
results which they have obtained have not been correlated satis- 
factorily. Published formulas [1-4]! for the bend-loss coefficient 
have given results which are in considerable quantitative dis- 
agreement. It is evident that a more detailed study is necessary 
before the problem is placed on a certain and systematic basis. 
Recent work of Pigott [4] is welcomed, as he furnished a more 
nearly rational and consistent method of computing loss of head 
due to bends. However, the scatter of the data in his formulation 
appears to be too large to be attributed to the experimental 
errors. 

The work discussed in this paper was undertaken to obtain in- 
formation which would be of some help with the correlation of 
previous results and, further, to furnish modern and more ac- 
curate data for engineering use. The paper contains, in con- 
densed form, considerable information on the effect of relative 
radius, Reynolds number, and deflection angle on bend resistance. 
An attempt to organize and rationalize the bend-loss coefficient 
is given. 

The author does not claim to offer a new empirical formula for 
computing loss of head due to bends which may be used under all 
circumstances, since bend resistance varies with the different 
velocity distribution in the approach pipe, or with the proximity 
of another fitting in the upstream or in the downstream tangent, 
and so on. The author’s aim is to furnish an empirical formula 
which will surely be of some help with the correlation of the 
bend-loss coefficient, and further, to draw attention to the im- 
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portance of the pressure losses that take place in the downstream 
tangent which are apt to be overlooked. Although the experi- 
ments to be discussed are limited to the case of smooth pipe 
bends, the results doubtless will give closer definition to the 
present vague outlines of the laws of bend resistance. 


Experimental Apparatus 


The pipe bends used in the tests were brass castings, made up 
of two pieces in the plane of the curved pipe axis. Their inner 
surfaces were machined and polished carefully with fine sand 
paper to accurate geometrical forms. The bends tested are shown 
in Fig. l(a). In addition to these brass bends in split castings, 
experiments also were conducted for two commercial screw elbows 
of 45 and 90 deg. Their nominal size was 1'/,in. The details of 
the fit-up of their screwed ends are shown in Fig. 1(b). The 
dimensions of the bends and elbows are found in Table 1. The 
straight pipes, or tangents, connected with those bends were of 
smooth, drawn-copper tubing, of 35 mm ID, specially selected for 
straightness and uniformity of their bores. Their friction factors 
were in satisfactory agreement with published values [5-7] for 
smooth pipes. 

A typical arrangement of experimental apparatus is illustrated 
in Fig. 2. The water, led from a surge tank or from a constant- 
level tank, after passing through a honeycomb flow straightener 
and a bellmouth entrance, enters the pipe under test. About 


Table 1 Dimensions of pipe bends which were tested 
Pipe Deflection Radius of Relative 
bend angle, 4, curvature, R, radius, 

deg-min ID, 2r, mm mm R/r 
44-53 34.90 68 
45-13 34.96 46 
44-6 34.97 60 
SO-48 35.01 2.01 
80-3 90 
89-30 < 65 

178-24 34.67 
179-36 34.74 


won Or 


9 (screw 

elbow) 45-0 
10 (screw 

elbow) 90-0 


2r, internal diameter of pipe 

friction factor for a curved pipe 
= friction factor for a straight pipe 

acceleration of gravity 


gated disturbance, as defined by 
equation (2) 

bend-loss coefficient measured at 
exit of bend, as defined by equa- 


radius of center line of bend 
vd/v, Reynolds number 

mean axial velocity 
dimensionless number defined 


loss of head 

total bend-loss coefficient defined by 
equation (1) 

bend-loss coefficient measured 
within the region of the propa- 
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tion (3) 
axial length of center line of bend 
length of upstream tangent 
length of downstream tangent 
radius of circular cross section of 
pipe 


equation (9) 
numerical factor determined 
equations (6) to (8) 
deflection angle of bend, radians 
deflection angle of bend, deg 
kinematic viscosity of fluid 
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Fig. 1(a) Pipe bends which were tested 


115 diameters were used for a calming length between the bell- 
mouth and the first piezometer tap. All the experimental pipes 
were connected by brass couplings, the details of which are shown 
in Fig. 2. Each manometer connection at the pipes consisted of 
four holes, 0.7 mm diam, drilled at 90-deg intervals in the circum- 
ference. Sufficient care was taken to scrape off the burrs so that 
the inside edge of each hole may be flush with the surface. The 
discharge was measured by the triangular weir with a vertex angle 


R= 28,4mm R=50,4mm 90 deg for high flows, and by the orifice meter near the exit for 
D=45,4mm 0=42,2mm the lower flows. Both of them had been calibrated accurately 
d=34,9mm d=34,9mm 


before the experiments. 
Fig. 1(b) 1'/,-in. screw elbows which were tested 


3 
PIEZOMETERNO.| | 2 | 3) 4/5 |) 6/7) 6) 
RELATIVEg lu/d| | 7,6 |15,0| 222/308) 
LENGTHS | l4/d| | 9,3 | 17,6 | 27.6| 41,4 | 54,3, 61,6) 
apporatus: Honeycomb flow 
: straightener, (b) 3-in. pipe, (c) bell- 
rs mouth entrance, (d) upstream tangent 
wh (drawn-copper tubing), (e) pipe bend 


under test, (f) downstream tangent 
(drawn-copper tubing), (g) 2'/,-in. 
pipe, X piezometers 
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Fig. 3 Pressure distribution along pipeline containing a 90-deg pipe bend (R/r = 3.7) 


Experimental Results 


Pressure Drop in the Pipeline Containing a Bend. A typical example 
of the curves of the measured pressure distribution along a bend 
including tangents is shown in Fig. 3, where the experimental 
values of h/(v?/2g), h being the pressure head on the pipe wall, are 
plotted against the axial length of the pipe expressed in diameters. 
There is a marked increase in pressure along the outer wall of the 
bend, accompanied by a corresponding decrease in pressure along 
the inner wall. The distorted flow condition persists for some 
distance downstream, and in most cases the distance of about 50 
diameters is necessary before the pressure gradient downstream 
of the bend becomes constant and presumably the same as that of 
the upstream tangent. 

Loss Coefficients for Smooth Pipe Bends. As shown in Fig. 3, when 
a fluid is forced through a pipe bend, a considerable portion of 
the energy is lost in the downstream tangent. This relation is 
shown schematically in Fig. 4, where A-B-C-D-E represents the 
actual hydraulic gradients, and A-B’-C’-D’-E’ the hydraulic 
gradient in a corresponding straight pipe. If the pipe bend were 
removed, the hydraulic gradient would be represented by A-B’- 
C’-D’-E”. Thus the total head loss chargeable to the pipe bend 
becomes Ah,. The total bend-loss coefficient is given by 


k, = Bh,/(v*/2g) (1) 


If the terminal piezometer lies within the region of the disturb- 
ance, as point D in Fig. 4, the bend-loss coefficient measured 
at this point will be given by 


k = Ah/(v*/2g) (2) 


Especially the bend-loss coefficient measured at the exit of the 
bend is 


k, = Sh,/(v*/2g) (3) 
The definitions (1) to (3), which does not exclude an imaginary 


quantity of pipe friction corresponding to the physical length of 
the bend, as it accords with the recent opinion of Pigott [4] and 
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Fig. 4 Diagram of a typical hydraulic gradient 


Beck [8], has not only the advantage of treating the pipe bend as 
a complete entity, but also makes the bend resistance easier to 
formulate. 

The measured bend-loss coefficients as defined by equations (1) 
to (3) are shown plotted against the Reynolds number in Fig. 5. 
The bend-loss coefficients within the region of the disturbance 
were measured at 9 and 18 diameters downstream from the bend. 
In every case the increase in resistance due to the bend was unde- 
tectable in more than 50 pipe diameters downstream from the 
bend. In Fig. 5 available empirical formulas (1) to (4) for the 
bend-loss coefficient were compared with the experimental re- 
sults. It is apparent that none of the published formulas gives 
satisfactory agreement with the experimental results in so far as 
tests are concerned. 

Besides these small-radius pipe bends in split castings, 10 long- 
radius pipe bends of smooth drawn-copper tubing, with radius 
ratio ranging from 10.4 to 216, and with deflection angle from 45 
to 180 deg were tested. Omitting the details of the experiments, 
which are available in reference [9], we present the total bend- 
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Fig. 7 Variation of total bend-loss coefficient with relative radius for 
smooth pipe bends. Full line, Equation (5) or (11). 


Fig. 8 Variation of total bend-loss coefficient with deflection angle for 
Fig. 6 Total bend-loss coefficients for long-radius pipe bends smooth pipe bends 
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loss coefficients in Fig. 6. In the case of these long-radius pipe 
bends, the increase in resistance in the downstream tangent was 
relatively small, and most of the losses were observed to take 
place between the entry and the exit of the bend. 

Effect of Relative Radius. Fig. 7 illustrates the variation of the 
total bend-loss coefficients with the relative radius for a Reynolds 
number of 2 X 10°. In Fig. 7 the results obtained by Hofmann 
{10] and Wasielewski [11] for smooth pipe bends are included for 
comparison. A minimum value of k, occurs at an R/r of about 5 
for 45 and 90-deg bends, and about 4 for 180-deg bends. 

Effect of Deflection Angle. [xcept for long-radius pipe bends, the 
total bend-loss coefficient is not proportional to the deflection 
angle for a given value of R/r. Effect of deflection angle is il- 
lustrated in Fig. 8 for a Reynolds number of 2 XK 10°. 

Loss Coefficients for Screw Elbows. The sections of the commer- 
cial screw elbows of Japanese industrial specification shown in 
Fig. 1(b) are larger than the pipes which they connect. The 
eddies produced by such enlargement and contraction of section 
cause losses of head in addition to the losses of head mentioned in 
the foregoing. The measured loss coefficients for 45 and 90-deg 
screw elbows are shown in Fig. 9. As expected, although their 
inner surfaces were carefully polished and plated with cadmium 
so as to avoid the effect of surface roughness on bend-loss co- 
efficients, the measured loss coefficients are considerably larger 
than the loss coefficients in the case of smooth pipe bends 


Empirical Formulas for Total Bend-Loss Coefficient 


Formulation of Total Bend-Loss Coefficient With Nondimensional 
Number R(r/R)?. The total bend-loss coefficients k, divided by 
@(R/r)'/*, were plotted in Fig. 10 as a function of the nondimen- 
sional number R(r/R)*. In Fig. 10 the experimental results of 
the previous investigators [10-15] or smooth pipe bends were in- 
cluded for comparison. It appears that two distinctive groups of 
the bend-loss coefficients may be classified as follows: (a) The 
group in which all the experimental results conform satisfactorily 
to a single curve regardless of the deflection angle and the relative 
radius. The bend loss arises mainly from the wall friction, an | 
the bend-loss coefficient is proportional to the deflection angle. 
Pipe bends of larger relative radii belong to this group. (b) The 
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group in which the experimental results fall above the correlation 
curve mentioned, and no consistent trend is observed. Pipe 
bends of smaller deflection angles and smaller relative radii be- 
long to this group. 

The following empirical formula was deduced from the experi- 
mental results shown in Fig. 10, where the range of the Reynolds 
numbers tested are 2 K 10¢< R <4 X 10°: 

For R(r/R)* < 91 


k, = 0.00873 af.0 


For R(r/R)? > 91 


k, = (5) 


where f, is the friction factor for turbulent flow in curved pipes 
[16] and @ is a numerical coefficient, its values being shown in 
Fig. 11. For pipe bends of large relative radius, a is equal to 
unity. Approximate expressions for @ are: 

For 0 = 45 deg 


@ = 1+ 14.2(R/r)-*7 (6) 
For 6 = 90 deg 
a = 0.95 + 17.2(R/r)--™, R/r < 19.7 ) 
and 
a=1, 


For 6 = 180 deg 


R/r > 19.7 


= 1 + 116(R/r)~*-5 (8) 


Formulation of Total Bend-loss Coefficient With Nondimensional 
Number ¥%r/@)'/*.. The experimental results were plotted in Fig. 12 
as a function of the nondimensional number Y*(r/R)'/* where Y 
is defined by the equation 


= R(R/r)'/* (9) 


It is noteworthy that a similar relation to that in Fig. 10 holds in 
this case. The following empirical formula was deduced from 
the experimental results shown in Fig. 12: 

For ¥%(r/R)'/* < 9.4 


k, = 0.00873af,0 


For ¥2(r/R)'* > 9.4 


0.925 
ky = Of "40a0Y ( 


(11) 


In the range of tests, the numerical values of @ are practically the 
same as those found previously. To facilitate the calculations of 
k,, Fig. 16 is given in the Appendix from which Y can be read for 
any value of R(R/r)'’*. 

The total bend-loss coefficients computed from these equations 
were compared with the experimental results in Figs. 5 and 7. 
The proposed formulas are in good agreement with experiments. 
The only exception is the 90-deg bend of R/r = 2, where dif- 
ference is observed for the smaller Reynolds numbers. In general, 
the difference between equations (5) and (11) are hardly per- 
ceived in the figures. In the range of tests the author prefers 
equation (5) rather than equation (11), since in the former k;, is 
given explicitly as a function of the Reynolds number. 

Losses in Screw Elbows. In the case of the commercial screw 
elbows, shown in Fig. 1(5), a large increase in resistance due to the 
enlargement and contraction of section was observed. However, 
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Equation (8) 


in the present experiments, it appears that the addition of the 
loss coefficients for sudden expansion and contraction [17] to the 
proposed empirical equation forms a rough estimate of the loss 
coefficients. The results are seen in Fig. 9. 


Losses in Downstream Tangent 


Bend Loss Measured at Exit of Bend. The ratio of the loss co- 
efficient k, measured at the exit of the bend to the total bend-loss 
coefficient is shown in Fig. 13, where the experimental results of 
Alexander [18] and Richter [2] are included for comparison. The 
velocity traverse in a pipe bend [14] shows that the velocity dis- 
tribution may be symmetrical when the flow is entering a bend, 
but when coming out it is not. Hence it should be emphasized 
that the measured results in Fig. 13 express not the loss of energy 
head, but the pressure drop measured at the wall of the pipe. 

Bend Loss Measured Within Region of Propagated Disturbance. 
The bend-loss coefficients k measured at various distances down- 
stream are shown in Fig. 14, where the ratio of k to the total bend- 
loss coefficient is plotted against the distance expressed in pipe 
diameters. While the distorted flow condition persists for about 
50 pipe diameters downstream, most of the loss produced by a 
bend takes place within 30 diameters of the downstream tangent. 


Effect of Upstream Tangent 


The effect of the entrance region was studied for a 90-deg bend 
of relative radius 3.7. Preliminary pressure-drop measurements 
with a long straight pipe connected at the exit of the bellmouth 
directly had shown that the formulas for the entrance region of a 
pipe [19, 20] hold well in this case. Effect of the length of the up- 
stream tangent on bend resistance is illustrated in Fig. 15, where 
the length of the upstream tangent was varied from 0 to 20 di- 
ameters. The ordinate K is defined by the equation 


h f + la) Mh, 
(v?/2g) d (v?/29) 
where h is the total loss of head from the exit of the bellmouth to 
the point under consideration, and Ah, is the usual additional 
losses in the entrance region of a straight pipe. In this experiment 
about 20 diameters of the upstream tangent were necessary for 


the bend-loss coefficient to coincide with that given by equation 
(5) or (11). 
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while Alexander's results are at 3.2 diameters downstream from bend 
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APPENDIX 
Formulation of Total Bend-Loss Coefficient With the 
Nondimensional Number R(r/R)° 


When a fluid is forced through a pipe bend, a secondary flow is 
produced as a result of a centrifugal force, and this combines 
itself with the axial velocity to form a double spiral flow. Recent 
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Fig. 13(b) Bend-loss coefficients measured at exit of 45-deg and 180-deg 
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Fig. 16 Y as a function of R(R/r)'/s 


detailed investigations [16] on fully developed turbulent flow in 
curved pipes show that a remarkable correlation exists between 
the frictional resistance and the curvature effect. Now, in the 
case of a long-radius pipe bend, the bend-loss coefficient may be 
expressed approximately in terms of the friction factor for curved 
pipes as 
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changes of the pattern of flow along the axis of the pipe being 
disregarded. Since 1, = RO, equation (13) may be expressed in 
an alternative form 


ky 1 R 
2 ) (14) 


For moderate Reynolds numbers where the '/;-power velocity 
distribution holds in the shedding layer in the regions near the 
walls, equation (14) may be put in the form [16]: 

For 6 > R(r/R)? > 0.034 


2)-0.2 
0.0145 + (15) 
For R(r/R)? > 6 
k, 0.158 (16) 


O(R/r)'* 


According to the results of measurements shown in Fig. 10, equa- 
tions (15) and (16) hold with sufficient accuracy for long-radius 
pipe bends for values of R(r/R)? less than 91. For values of 
R(r/R)? greater than 91, the following empirical equation was de- 
duced from the experimental results: 


k, 0.138 
O(R/r)'* [R(r/R)2)-" 


(17) 


In the case of smaller relative radius, where the eddy loss 
created by the separation of flow may not be ignored, or in the 
case of smaller deflection angle, where the secondary flow may 
not be fully developed before the flow reaches the second transi- 
tion to the downstream straight pipe, the measured results de- 
viate successively from the correlation curve upward. Unfor- 
tunately, at present, no reliable theory on bend resistance ap- 
plicable to this case is available. To get a simpler formula for 
engineering use, it is assumed that the loss coefficients in this case 
are @ times as large as that predicted by equation (14) or (17), @ 
being the numerical factor determined by the experiments. As 
seen by the fine dotted lines in Fig. 10, this approximation seems 
to give fairly good agreement with experiments in the range of 
tests. Thus we have approximately: 

For R(r/R)? < 91 


ky 1 
9 () (18) 
For R(r/R)? > 91 x 


ky 0.138 
O(R/r)'* (R(r/R)2)-" 


(19) 


The numerical factor a@ determined by the experiments are shown 
in Fig. 11. For long-radius pipe bends @ reduces to unity, and 
the foregoing equations coincide with equation (14) or (17), re- 
spectively. Replacing 9 in (18) .ad (19) with @, and rewriting, 
we obtain equation (4) or (5). 


Formulation of Total Bend-Loss Coefficient With 
Nondimensional Number ¥°(r/R)': 


The friction factor for fully developed turbulent flow in a 
smooth curved pipe is expressed alternatively by the semi- 
empirical equation deduced from the logarithmic velocity-dis- 
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tribution law [16]. In this case the friction factor f, in equation 
(14) is expressed approximately in terms of the nondimensional 
number ¥Y%r/R)'/*, where Y is defined by equation (9). Fig. 16 
gives numerical values of Y as a function of R(R/r)'/*, Equa- 
tion (14) may then be put in the form: 
For ¥*(r/R)‘/? < 5.3 
k, 
= 0.0162 + 
+ 0.8[¥%(r/R)*] (20) 
For ¥%(r/R)'* > 5.3 
ky 0593 
¥%(r/R)'/* 


(21) 


According to the results of measurements shown in Fig. 12, the 
equations hold with sufficient accuracy for large-radius pipe 
bends for values of Y*(r/R)‘/* less than 9.4. For values of 
Y%(r/R)'/* greater than 9.4, the following empirical equation 
was deduced from the experimental results: 


ke 0.424 
(22) 


For small deflection angles and small values of R/r, measured 
results are expressed within the scatter of the experimental data 
by the following empirical equations: 

For ¥*(r/R)'/* < 9.4 


k, 
For ¥%(r/R)‘/* > 9.4 
ky 0.424a 
O(R/r)'* 


The numerical factor a@ determined by the experiments prac- 
tically coincides with that in the previous case. Replacing 0 
with 6, and rewriting, we obtain equations (10) and (11). 


DISCUSSION 
Cyrus Beck? 


The author is to be commended for the care exercised both in the 
selection of the ranges of Reynolds numbers and relative radii 
and on the prosecution of his experiments. In the flow regimes 
explored, his results are the most comprehensive published by any 
one investigator. The empirical formulas derived for total bend- 
loss coefficients will be useful for designers of piping systems in- 
volving bends in the Reynolds number, relative radii, and deflec- 
tion angle ranges examined. 

The hydraulic gradient in Fig. 4 gives clear and proper defini- 
tion to the pressure loss distribution, particularly in the down- 
stream tangent. The writer is gratified that Professor Ito took 
the bold step and plotted his coefficients as total bend-loss, a 
procedure also recommended by Pigott. A further salutary 
change would consist of presenting friction loss in terms of equiva- 
lent length in diameters of straight pipe of characterized rough- 
ness. This might lead from the labyrinth of loss coefficients. 

Since the bends studied divide themselves naturally into large 
relative radii and small relative radii, small deflection angle, the 
two types may be discussed separately. 

In bends of large relative curvature, the relation between total 
bend-loss coefficient, curved pipe friction factor, and bend length 
is interesting for a reason that is implied but which is worth point- 
ing out explicitly. Equation (13) can be written 


?Supervisor, Automatic Controls Branch, Aeronautical Instru- 
ments Laboratory, Naval Air Development Center, Johnsville, Pa. 
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Since R/r is the ratio of inertial force to centrifugal force, the in- 
ference may be drawn that once a pipe has assumed a large rela- 
tive curvature, it makes no distinction between the total change 
in flow direction, 9, and the ratio of inertial to centrifugal force, 
F,/F,. The mechanism of such indifference must be remarkable 
and certainly pertinent to the development of a reliable rational 
theory of bend flow resistance. 

Referring to Fig. 17, it will be observed that three dashed lines 
have been superimposed on Fig. 7. These lines represent the 
total loss coefficients, k,, for straight pipe of length equal to that 
of the corresponding bend. The general relation for straight pipe 
is 


k, (26) 


where / = length of straight pipe. 
Since for comparison the straight pipe length must equal the 
axial length of the bend, 


l 
2d 
The average of the friction factor values given by Blasius and 
Nikuradse for smooth pipe is 0.016 for Reynolds number 2 « 105 
which is constant for Fig. 7. Hence, 


R 
ks = 0.008 6 — (28) 
r 


from which the dash lines in Fig. 17 were drawn. For the R/r 
range plotted, it is clear that k, is always less than k,. 

As indicated by the curves in Fig. 7 and by Equations (13) 
and (14), the author has ascribed no upper limit to R/r for the 
Reynolds number concerned. Two cogent questions suggest 
themselves. Is it justifiable to extrapolate the curves in Fig. 7 
and to utilize Equations (13) and (14) to higher values of R/r 
indefinitely? Does k, approach k, for relative radii above the 
R/r range investigated? 

To focus the matter and for the sake of discussion, the usual 
intuitive assumption is made. As R/r approaches infinity, the 
friction factor for curved pipe, f,, approaches the friction factor 
for straight pipe, f. Conventionally, then, we may write 
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Fig. 17 Variation of total loss coefficient k, for straight pipe compared with k, curves of Fig. 7 


6 constant 


Limit f. = f 


For a straight pipe of length equal to the bend, 


6 constant (30) 


In view of Equation (13), Equation (30) becomes 


Limit k, = k, 


r 


6 constant (31) 


Equation (31) states that, as R/r becomes very large, k, ap- 
proaches and does not exceed k, in value. Fig. 17 does not re- 
veal such a tendency nor do Equations (13) and (14) admit such 
a possibility since f, is a constant independent of R/r. 

The premise upon which Equation (29) is based influenced 
Pigott, an experienced observer, (reference [4] ) to postulate con- 
tinuous decreasing of net bend-loss coefficient with increasing 
R/d, in spite of the conflicting evidence reported by Hofmann, 
Vogel, Beij, Busher, and Kittredge and Crowley. For the same 
reason, Professor Ito (reference [9]) rejected the bend-loss 
formula for 90-deg bends deduced by Pigott (reference [3] ). 
Similar logic applied to the author’s current results would impose 
a finite R/r upper limit on the validity of Equations (13) and 
(14) as well as on the reliability of extrapolating the curves in 
Fig. 7. 

Since the author’s work has been so admirably conceived and 
executed, to question its accuracy would be to grasp at straws. 
Nor is it reasonable to challenge the import of the collective re- 
sults of an imposing array of independent experimenters. If it is 
conceded the results of these investigators establish that the trend 
of net bend-loss coefficients, in the range examined, is not toward 
zero, and Equations (29) and (31) are simultaneously valid as 
limiting conditions, then somewhere between the relative radii 
studied and infinite relative radius, new flow phenomena, as yet 
unobserved, must reveal themselves. Of these inferred effects, no 
one knows anything. Furthermore, the acceptance of the thus 
deduced existence of new phenomena in such a situation is in- 
tuitively unsatisfactory. One is compelled to imagine, somewhat 
reluctantly, the bend flow modifying its character at some unique 
esoterically high R/r to permit the transition of f, to f. 
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Fortunately, some fundamental but perhaps elusive philosophi- 
cal or mathematical concepts may be invoked to illuminate and 
ultimately to resolve the seeming incongruities. 

Further scrutiny discloses Equations (29), (30), and (31) are 
incorrectly formulated. Equation (29) itself contains an inherent 
difficult-to-detect contradiction, though the premise on which it 
is based has probably been accepted by most competent ob- 
servers. 

The reasoning in support of the writer’s contention is as follows. 
Every physical bend has a deflection angle 6, that is, the upstream 
tangent makes the angle @ with the downstream tangent. This 
means the two tangents can never lie on the same straight line, 
regardless of the magnitude of R/r. In fact, the existence of de- 
flection angle @ implies inexorably that R/r cannot become in- 
finitely large, even though one says let it do so. Stated dif- 
ferently, it is physically impossible to increase R/r sufficiently 
to induce a bend to assume the shape of a straight pipe so long as 
Gis larger than0. Hence, while R/r for a bend of deflection angle 
@ can be made larger than any preassigned finite value, it cannot 
be made larger than all preassignable finite values. Only a 
straight pipe has an R/r magnitude larger than all preassignable 
finite values. Some may consider the distinction hairsplitting or 
irritatingly subtle but it has a direct bearing on the matter. 
Since for every bend there is a deflection angle @, and since for 
constant 6 > 0, the relative radius R/r cannot become infinite, f., 
the friction factor for curved pipe, can never equal f, the friction 
factor for straight pipe. Hence, the assumption on which Equa- 
tion (29) is based is specious. 

Does this mean then that a bend can never be continuously de- 
formed into a straight pipe? Not at all. Anyone who has used 
a rubber garden hose has certainly pulled a curled section into 
a straight one. The difference here is that the deflection angle is 
not being held constant. As R/r approaches infinity, @ approaches 
zero, the length |, of the curved section remaining essentially con- 
stant. That this is true may be readily established by pursuing 
the matter as follows. 

The deflection angle is given by 


(32) 


If 6 is specified to be constant, then the bend length must become 
infinite for the relative radius to become infinite. It is easy to 
make F/r infinite and exactly so. Nobody, however, can make 
l, infinite, even approximately so. Since, of necessity, 1, is finite, 
@ must vanish as R/r becomes infinite. Equation (32) gives 


Limit @ = 0 (33) 


which reveals that 6 cannot be maintained constant until the in- 
finite R/r limit is reached. 

It is apparent /, is simply the value of the indeterminate form 
that results from going to the limit. This is easily seen from 


Limit — — = I, l, constant (34) 
R 

r 

6—0 


Equations (29), (30), and (31) properly formulated become 


Limit f, = f I, constant (35) 
Limit f. 5 f a l, constant (36) 


Limit k, = k, 


l, constant (37) 


This suggests that plotting bend-loss coefficients against |, or for 
constant parametric values of 1, may have unsuspected ad- 
vantages. Bend loss curves plotted in this manner will reveal 
that, as R/r—~ ©, k,—k,andf,— f. 

In view of the development presented, the premise leading to 
Equation (29) should not be utilized as a criterion to establish 
the validity of bend-loss coefficient relationships. Equations (13) 
and (14) could properly represent the total bend-loss coefficient 
for R/r conceivably as large as desired. The fact that points ob- 
tained by Hofmann and Wasielewski for smooth bends lie on the 
curves in Fig. 7 gives them powerful additional support. 

In Fig. 8, the curves for low relative curvature are particularly 
interesting. At a relative radius of 3.7, it appears that adding a 
90-deg bend to another 90-deg bend results in very little addi- 
tional bend loss. The difference in k, for 180 deg and 90 deg is 
0.02 while 0.046 is computed from the friction factor of smooth 
straight pipe. This is not surprising since it is known that two 
contiguous valves, bends, or fittings in turbulent flow cause less 
pressure loss than the sum of their losses determined separately. 
An intriguing aspect of this situation lies in the contingency that 
this curve becomes asymptotic to a line representing constant ky. 
If this were the case, pipe bends of relative radius 3.7 could be 
added contiguously indefinitely without producing any additional 
pressure drop. A consequence of tiis “lossless curved pipe line’’ 
is that the pressure drop for bends beyond a particular deflection 
angle would be less than the pressure drop in an equal length of 
straight pipe. Nature, of course, has contrived the practical dif- 
ficulty of constructing a bend of such small relative radius to the 
large deflection angles undoubtedly required. 

Fig. 7 shows minimum values of k, for bends of all three de- 
flection angles investigated. The centrifugal force plays an 
analogous role with respect to relative curvature as the viscous 
force does with regard to Reynolds number. At low values of 
R/r in a bend and R in straight pipe, the centrifugal force and the 
viscous force respectively dominate the flow. At high values of 
R/r in bends and R in straight pipe, the inertia force determines 
the character of the flow. As Professor Ito has shown, the fric- 
tion factor for curved pipe becomes essentially independent of 
centrifugal force for large R/r. As is known, the friction factor 
becomes relatively free of the viscous force for large values of R. 

In closing, the writer begs Professor Ito’s forbearance for dis- 
gressing to a consideration of bend friction loss formulation in a 
discussion purporting to comment on his excellent paper. The 
very substantial experimental results presented really speak for 
themselves. 


Author’s Closure 


‘The author wishes to thank Mr. Beck for his interesting and 
valuable comments. 

Mr. Beck’s comments touch upon the interesting problem of 
the behavior of the total bend-loss coefficients for higher values 
of R/r. Unfortunately, experimental information for higher 
values of R/r is quite unsatisfactory. The only experimental 
information available is that reported by Keulegan and Beij 
(reference [15]) and v. Lewinski (reference [13]). Referring to 
Fig. 18, it is seen that the curve for 90-deg bend in Fig. 7 was 
extrapolated to higher values of R/r to include the experimental 
results of these authors. In the figure the experimental points for 
higher values of R/r were plotted on the assumption that k, 
is proportional to the deflection angle—an assumption which holds 
good for higher values of R/r. 1n Fig. 18 the dashed line repre- 
sents the total loss coefficient, k,, for straight pipe of length equal 
to that of the corresponding bend. It is seen that the experi- 
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Fig. 18 Variation of total bend-loss coefficient with relative radius for 90-deg bends: 


R/r 
1) Equations (18) 


104 


and (19), 2) Equations (23) and (24), 3) total loss coefficient k, for straight pipe of the same axial length 


as bend 


mental points approach the dashed line representing k, for higher 
values of R/r. 

Fig. 18 also includes two k, curves calculated by equations 
(18)? and (19), and by equations (23) and (24), respectively 
From the comparison of the calculated and experimental results, 
it is apparent that equation (13) may be used to extrapolate the 
curve in Fig. 7 to higher values of R/r such as shown in Fig. 18. 
A small discrepancy between k, curve calculated by equation 
23) and experimental points for higher values of R/r exceeding 
10° may be attributed to the somewhat rough empirical formula- 
tion of equation (20) for such higher values of R/r 

As Mr. Beck has stated, when the deflection angle @ is held 
constant, the two tangents can never lie on the same straight 
line regardless of the magnitude of #/r, and at a given R/r, k, 
must be larger than k, since some additional losses must be 
caused before the flow has left the bend after its turn of the angle 
6, say, 90-deg. That is 


k, + € (38) 


3 For R(r/R)? < 0.034, f. = f, (reference [16]). The author re- 
grets that this relation was not mentioned in the Appendix. 
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where € > 0. However, by the inspection of Fig. 18, it is clear 
that while &, increases in proportion to 2/r, € remains finite, and 


hence we shall have 


— 0), 


k 


for higher values of R/r (39) 


It follows that 
k, 


k 


|, 


for higher values of #/r 


by virtue of equation (38). 

Mr. Beck states that two contiguous valves, bends, or fittings 
in turbulent flow cause less pressure loss than the sum of their 
losses determined separately. A number of pressure loss meas- 
urements on compound elbows have been carried out recently 
in the hydraulic laboratory of the Institute of High Speed Me- 
chanics, Tohoku University. The experiments and conclusions on 
compound elbows will be published in the Memoirs of the Insti- 
tute of High Speed Mechanics shortly. 
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Discharge Coefficients and Steady-State 
Flow Forces for Hydraulic Poppet Valves 


Experiments were run on a poppet valve operating in hydraulic oil. 


The expert- 


mental values of the flow forces and discharge coefficients were about 25 per cent below 
the predicted theoretical values. Although there was some scatter in the values of 
discharge coefficients, there was good correlation for the flow forces. 

Flow forces are strongly influenced by the downstream configuration. The smaller 
the diameter of the downstream chamber, the higher the forces. 

A poppet configuration was designed and tested which virtually eliminated the flow 


forces. 


A POPPET VALVE is a seating-type valve in which the 
moving element or poppet, usually spherical or conical in shape, 
moves in a direction perpendicular to its seat. Because of the 
several advantages that are associated with poppet valves such 
as ease of manufacture, minimum leakage, and insensitivity to 
clogging by dirt particles, poppets have been used for years as 
pressure regulators and relief valves. Fig. 1 shows a typical pres- 
sure regulator. 

The operation of this type of valve is quite simple. The fluid 
pressure counterbalances the spring force and allows fluid to 
escape through the annular passage way between the poppet and 
the seat. An ideal regulator should have a static flow versus pres- 
sure curve made up of two straight lines as shown by curve A in 
Fig. 2. An actual relief valve most likely would have a pressure- 
flow curve similar to curve B, which may be unacceptable in cer- 
tain applications. 

The recent interest in using poppets in high-performance sys- 
tems such as hydraulic servo valves has necessitated a better 
understanding of the factors which influence its performance 
characteristics. Although considerable work has been done in 
studying sliding-type valves [1, 2, 3, 8, 9, 10, 15],! the difficulty 
of separating ‘fluid’? parameters from “geometry”? parameters 
has prevented similar efforts in the case of poppet valves. Almost 
all the work dealing with poppets so far has been empirical in 


' Numbers in brackets designate References at end of paper. 
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nature except for the contribution of von Mises [15] who studied 
a/general valve configuration using an idealized incompressible 
fluid. 

“Among the many aspects that are not well understood today 
are the discharge coefficients, the induced flow forces, the effects 
of various geometrical dimensions, the causes of self-excited vibra- 
tions and the pressure-flow characteristics. 


Theoretical Analysis 


For small openings, the flow through the poppet valve shown 
in Fig. 1 may be assumed to be an orifice-type flow (the flow being 
proportional to the square root of the pressure drop). Using con- 


ventional notation, 
Q=C,A (1) 
p 


volumetric flow rate, cu in. /sec 
orifice-discharge coefficient 
orifice area, sq in. 

pressure drop across valve, psi 
fluid density, lb-sec?/in.* 


~ 
~ 
i 


The velocity at the vena contracta is 


2P 
recy (2) 
p 


where C, is the velocity coefficient. 
It is to be pointed out here that the following study will be 


valve area 


d extra closing force on poppet = volume rate of flow 
A, = exhaust-chamber area F, = force on valve interface r = radial clearance between piston 
A, = area of top of poppet F,, = pressure force on poppet and valve 
A, = inlet area F, = shear force on poppet R, = Reynolds number 
Cy = discharge coefficient defined by h = valve opening perpendicular to S = valve-seat width 
Ca = (Q/A)(p/2Pr)'72 flow Vin = inlet velocity 
C, = velocity coefficient defined by C, P = pressure Vout = exit velocity 
= V(p/2Py)'” P, = exhaust pressure X = valve opening in direction of mo- 
D = valve diameter P; = inlet pressure tion 
D, = equivalent diameter defined by 4 P,; = pressure under exhaust jets a = valve angle between vertical and 
(cross-sectional area/wetted Po = P; — P, pressure drop across poppet face 
perimeter ) valve mw = 3.14159 


force on poppet 
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pressure across top of poppet 


density of oil 
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limited to a sharp-edge valve (S = 0, Fig. 1). Brief mention will 
be made of the qualitative effects of changing the seat width S, 

There are two types of forces which act on the poppet face. 
One is normal to surface and the other is tangential. The normal 
force arises from the pressure distribution across the poppet sur- 
face and the tangential force is viscous in nature. The following 
discussion assumes that the pressure force seen by the closed 
poppet is balanced out. 
will be discussed later. From this point on, the poppet force will 
refer to the net force on the poppet less the pressure force on the 
closed poppet. 

Ideal Poppet with Ideal Fluid. ‘his is the case of a poppet valve hav- 
ing a sharp-edged seat and using incompressible and inviscid 
fluid exhausting into an infinite chamber. Writing the momen- 
tum equation for the control volume shown in Fig. 3 while ignor- 
ing the pressure force, we get 


Means of balancing out this force 


F = momentum out — momentum in = pQ( Vou cos a — Vin) 
(3) 
Since for most practical cases the valve opening is small in com- 
parison to the valve diameter, Vi, can be neglected when com- 
pared to Vou. The direction of this force F is such that it tends 
to close the poppet. 
For very small openings, the valve area is 


A = = rDX siti a 


PRESSURE 


FLOW 


Fig. 2 
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Combining Equations (1), (2), 
expressed as follows: 


(3), and (4), the foree F ean be 


F = X(rP.DC,C, sin 2a) (5) 


This force can be considered as a spring force with a spring con- 
stant of rP,)DC .C, sin 2a. 
dimensionless form as in 


This equation can be rearranged to a 


= CC a9 sin 2a 


Alternately Equation (6) can be expressed 


= ('(2/p)'/* cos a (7) 

For an incompressible and inviscid fluid von Mises evaluated 
C, for a general valve configuration. His results for a conical 
poppet valve having a small opening are plotted in Fig. 4. 
The plot for F/(Q VP») which is not shown would be a cosine 
wave. For further discussion of the equations, the reader is re- 
ferred to references |16, 15, 11}. 

ideal Poppet With Finite Interface. If the seat is parallel to the 
poppet, then the fluid is unable to follow the sharp bend, labeled 
A in Fig. 5, and it separates from the corner. The pressure along 
the seat is the same as the exhaust chamber and the original 
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Fig. 4 Flow force and discharge coefficient versus poppet angle (calcu- 
lated from von Mises theory) 
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analysis holds. The same is true if the seat angle is greater than 
the poppet angle so that the valve contacts the point labeled A. 
If the seat angle is less than the valve angle, so that the contact 
is made at B, then the pressure along the seat varies from the 
upstream pressure to the downstream pressure. This pressure 
tends to open the closed valve. If point A is rounded then there 
may be a pressure along the seat tending to open the valve. 

ideal Poppet With Real Fluid Exhausting Into Air. When a poppet ex- 
hausts into an infinite chamber filled with air there are viscous 
drag forces both internal and external to the control volume. 
The only external force of any consequence acts in the high-pres- 
sure chamber along the valve body. As this force opposes the 
fluid motion, it tends to close the valve. The viscous forces in- 
ternal to the control volume decreases the outgoing momentum 
and thus the closing force. However, as there is always some 
momentum leaving the control volume, there is always a closing 
force on the poppet. 

The most important parameter affecting C, and C, is Reynolds 
number. Based upon an equivalent hydraulic diameter, the 
poppet Reynolds number is 


rDX si 
p 2rD 


(ci) (es) 


where D, is the equivalent hydraulic diameter. Because this is 
an awkward formulation, the C,/C, term will be dropped and 


Reynolds number defined as 
(2 
aD 


Ideal Poppet With Real Fluid Exhausting Into a Real Fluid. Consider 
the control volume of Fig. 6 where the dashed line is the control 
volume and the solid line is the fluid jet leaving the poppet and 
striking the exhaust-chamber walls. A, is the exhaust-chamber 
area, A, the area of the top of the poppet, A, the valve area. 
The exhaust pressure is P,, the pressure across the top of the 
poppet P,, and the pressure under the fluid jet Pj. If the control 
volume is drawn far enough downstream then fluid friction and tur- 
bulence will have dissipated the jet so that the fluid will exit with 
uniform velocity and pressure across the control volume. Ve- 
locity gradients between the moving and the stagnant fluid intro- 
duce forces which tend to close the valve as they oppose the fluid 
motion. Tidor [14] has shown that there is a large amount of 
turbulence present. This turbulence creates shear forces which 
may either open or close the valve. The turbulent fluid within the 
exhaust chamber strikes both the top and the underside of the 
poppet and the resulting impact of the fluid may either tend to 


2Py si 
R, cy Po DX sin 


close or open the valve. Although the general problem would be 
difficult to solve, a rough approximation can be made from the 
given control volume. If it is assumed that P, is constant under 
the fluid jet and that P, is constant across the top of the poppet 
and equal to P, then neglecting the shear forces on the control 
volume as well as the height of the exhaust fluid column for a 
vertical poppet, then the pressure forces acting on the poppet are 
P,A, + P,(A, — A,) across the bottom and P,A, across the top. 
If A, is large in comparison to the valve opening, the fluid will 
leave with negligible momentum. Equating the pressure forces 
with the entering fluid axial momentum force F as given by 
Equation (5) leads to 


F = P,A, — PA, — A,) — P.A, 
(10) 
= (P, — P;(A, A,) 


But because F is positive P, must be greater than P, so that there 
is an extra closing force on the valve equal to 


F, = (P, — P,(A, — A,) (11) 


Combining these two equations leads to 


F,/F = (A, — A,)/(A, — Az) (12) 

If P, is atmospheric pressure, then P; must be less than at- 
mospheric, and air dissolved in the fluid may come out of solu- 
tion. The increase in closing force, low pressure under the fluid 
jet, and the release of dissolved air have all been observed ex- 
perimentally, suggesting that the foregoing equation gives a quali- 
tative representation of what is happening. 

Effect of a Finite Interface With a Real Fluid. If S = 0 there is a 
drag force along the poppet interface which tends to open the 
poppet; however, the pressure acting along this face gives a much 
larger opening force. Assuming that the flow is laminar, then 
equating the shear force and the pressure drop across the passage 
leads to the shear force F, 

2F, = (13) 
In laminar flow, the pressure drop is uniform along the passage 
so that the pressure force F, is 


F, = xDS(Po/2) (14) 


However, for laminar flow S is at least ten times as great as h so 
that F, can be neglected when compared to F,. Resolving this 
force in the axial direction leads to an opening force of 


TDP, 
= S sin a@ 


F =F,sina = (15) 
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Pressure Forces on the Closed Poppet. Consider the closed poppet 
shown in Fig. 7 where F, is the force which acts along the valve 
seat. If P, is the inlet pressure and P, the exhaust pressure, the 
force tending to open the valve is 


= (D + 28Ssina)?+F,sina (16) 


F, is the product of the seat area times a pressure which is be- 
tween P;and P,. If the seat angle is greater than the valve angle, 
then the poppet seats against point A of Fig. 5 and the pressure 
on the interface is P,. If the seat angle is less than the valve 
angle then the pressure along the interface is P,. If the two faces 
are parallel, it is assumed that the pressure drop will be linear so 
that 


P. +P, 


F 
2 2 


(17) 


Combining Equations (16) and (17) and noting that S << D leads 


to 
F = 4 (D + 8 cin a)(P, — P,) = —— (D + S sin a)? (18) 


Methods of Force Compensation. Usually the pressure force is 
balanced out by an opposing force. The most common method 
of balancing the force is to attach a spring, preloaded to the value 
given by Equation (18), to one side of the valve. The dis- 
advantage of this method is that the spring force increases with 
opening giving the rising pressure flow curve of Fig. 2. Another 
method is to attach a piston to the high-pressure side of the pop- 
pet. If the piston has the same diameter (D + S sin @ for best 
results) as the valve, then the high-pressure fluid acting on equal 
areas but in opposite directions cancels out. This method has 
several disadvantages. It is expensive to manufacture because of 
close tolerances and the piston introduces a force which tends to 
open the valve. Consider the control volume of Fig. 8. If the 
radial clearance r is small the leakage flow past the piston is 
laminar. Because laminar flow has a parabolic velocity profile, 


there is no shear force in the center of the passage. However, as 
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the control volume no longer follows the wall, there is a pressure 
force tending to open the valve equal to 


r 


F = 2 


F ( \ ( r ) 

2)\x 
Another method of balancing out forces, where friction is a 
major consideration, is shown in Fig. 9. The high pressure acts 
on both an upstream and downstream side of the valve. The low 
pressure does the same so that the net force on the valve is zero. 
The major disadvantage of this configuration is the difficulty of 
aligning the poppets and having them both seat simultaneously. 
The momentum force may be canceled out also. Considera- 
tion of Equation (6) shows that the momentum force is zero 
when a = 90 deg. A mushroom-shaped poppet has the desired 

shape and has proved successful. 


(19) 
or 


(20) 


Experimental Apparatus 


A schematic representation of the experimental apparatus is 
shown in Fig. 10. The principle of operation is as follows. Fluid 
enters in between the poppet and the piston and flows past the 
poppet, thus creating an opening force. The pressure force on 
the closed poppet is balanced out by the piston, and a relatively 
low-pressure fluid is introduced into the chamber below the piston 
to balance the flow force. The piston and poppet assembly can be 
maintained in a position of static equilibrium by regulating the 
pressure below the piston. 

The poppet is removable from the piston so that various poppet 
configurations may be tested. The seat is | in. in diameter and 
the radial clearance between the piston and the valve body is 
less than 10~* in. A large solenoid attached to the side of the 
apparatus would shake the assembly and provide “dither’’ for 
reducing the static frictional effects between the piston and the 
cylinder. The valve was reground between runs in order to 
maintain as sharp an edge as possible. All runs were carried 
out with the apparatus in a vertical position and covered by 
a 12-in-diameter plastic cylinder to retain the splashing fluid and 
permit visual inspection. 

Instrumentation and Accuracy. The flow was measured by a 2000- 
psi, 0-10 gpm flowmeter calibrated to 0.05 gpm and was inde- 
pendent of pressure and temperature. 


ACCUMULATOR 


TEST 


STAND 


TEST APPARATUS 


Fig. 10 Piping diagram 


MARCH 1960 / 14] 


ie 

_ 

Lie | 

4 

| | 


All pressures were measured with Bourdon tube gages, dead- 
weight calibrated to 1 per cent of full seale. The estimated 1.2 
maximum pressure loss between the high-pressure gage and high- 

pressure chamber was 6 psi so that over-all accuracy for the 

high pressure was within 25 psi. The values of F in the results 

have been corrected for piston weight, poppet weight, height of 
fluid-exhaust column if the poppet exhausted into a finite cham- a+ woxcgy=ee ome 
ber, and difference in height between pressure gages and the 

bottom of the piston. Estimated over-all accuracy for the flow 1.0 
force is 2 per cent. 


The opening was measured by a linear differential transformer. 
The slug was attached to an unstressed portion of the piston and 
the coil was attached to the base. Vacuum-tube voltmeters 
measured the voltages and were calibrated to within | per cent. 
Measurements were taken only after steady-state temperature ° i 
was reached. Change of winding resistance with temperature in- 08 * li 
troduced another error of 1 per cent. The largest error in deter- ~ - ° 
mining the opening was the determination of the ‘‘zero’’ position. J r: 
Because the poppets leaked, even when pushed against the seat, 
the zero position was found by extrapolating the readings of . 
flow versus opening to zero flow. With this method, it was . 
estimated that the opening was determined to within 0.7  10~‘ 
in. At full opening, the over-all accuracy is 4 per cent. ad LY 

The fluid temperature was measured by a thermometer placed 
in the inlet lines. The accuracy was +5 deg F. 

Determination of the Variables. The flow force was found by mul- 
tiplying the corrected pressure by the piston area, and then sub- 
tracting the piston and poppet weight. CqaX was found by com- 


bining equations (1) and (4), which reduce to 56 Rel i 
04 


C4 


/D ‘ ° 4 8 2 
CaX = (const \(Q/vo) (31) IN GALLONS PER MINUTE 
The compensated poppet weighed '/,. lb and all of the others Fig. 11(b) 


: t Discharge coefficient versus flow (45-deg poppet at 140 F) 
weighed 7/i¢ lb. All of the tabulated data, except for the com- 
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Fig. 11(d) Discharge coefficient versus flow (45-deg poppet with 1.75- 
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Fig. 11(f) Discharge coefficient versus flow (30-deg poppet) 


Fig. 11(g9) Discharge coefficient versus flow (45-deg compensated 
poppet) 
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pensated poppet, list this corrected force. The data for the com- 


pensated poppet also were corrected by means of Equation (19) 
to provide a better comparison between different pressures. The 
estimated accuracy on the flow force was 5 per cent for large 
openings. The oil used was a mixture of Univis 40 and J43. R, 
is determined from Equation (9) using an average value of the 
properties of the two oils. 


Experimental Results 


Description of Test Conditions. ‘Tests were run on four different 
poppet configurations, a 45-deg cone, a 45-deg cone compensated 
for flow forces, a 45-deg sphere, and a 30-deg cone. The 45-deg 
cone was run under five different conditions. Three of the runs 
were into an “‘infinite’’ exhaust chamber with varying oil tem- 
peratures and two runs were made with the poppet exhausting 
into a finite-diameter chamber. The other poppets all exhausted 
into infinite chambers. The data are plotted in Figs. 11-14. 
Each set of data, for a particular poppet and running condition, 
has been given a run number. With the exception of Run No. 3 
which was carried out at an oil temperature of 140 F, all other 
runs were made at 110 F. The data were usually reproducible 
within a few per cent. 

Experimental Values of C,. The discharge coefficient Cy is 
plotted against flow and Reynolds number in Fig. 11. The scatter 
in the data is most evident in Run No.7 where the first data, taken 
at 500 psi, agree with the value predicted by von Mises, but the 
value for the rest of the run is 20 per cent lower. These data do 
not agree with other experimenters [14]. The value of C, for 
the 45-deg cone varied from run to run and, at large openings, the 
average value was about 0.67 which is lower than the predicted 
value of 0.747. The value of C, for 45-deg sphere was slightly 
lower at 0.62. 


Experimental Values of F. The flow force is plotted nondimen- 
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Fig. 12(a) Flow force versus opening (45-deg poppet at 110 F) 
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Fig. 12(b) Flow force versus opening (45-deg poppet at 140 F) 
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Fig. 12(c) Flow force versus opening (45-deg poppet with 3.5-in. side 
walls) 
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Fig. 14 Flow force versus temperature for 45-deg poppet at “constant” 
opening and pressure. Reynolds number versus temperature. 


sionally versus the opening in Fig. 12. For Run No. 4 the data 
are plotted in various different ways in Fig. 13. The differences 
in correlation are typical for the other runs. When the data are 
plotted as in Figs. 13(a) and 13(c), the valve opening X does not 
need to be determined. For Fig. 13(d), the flow does not need to 
be determined. 

The values of the force, at large openings, are within 20 per cent 
of the value predicted by the von Mises’ theory. If the experi- 
mentally determined values of C, are used in Equation (6), the 
predicted forces are within 10 per cent of the experimentally de- 
termined forces. The closing forces for the 45-deg sphere are 15 
per cent higher than those for the 45-deg cone. For the 45-deg 
poppet in the middle ranges of opening, the force is proportional 
to the pressure times the opening to the three halves power. 

Run No. 8 was for the compensated poppet and the flow force 
was virtually eliminated. The plot is kept dimensional because 
of the small magnitudes of the foree. Since the zero position is 
only accurate to +0.1 lb nondimensionalization would lead to 
meaningless curves. The data for this curve are corrected for the 
opening force of the piston by Equation (19). 

Run No. 9 shows the effects of oil temperature upon the flow 
forces. There seems to be no correlation between temperature 
change and C,. These plots seem to indicate that the flow 
forces are a strong function of temperature although they are not 
as strong a function of Reynolds number. The forces increase 
with increasing temperature and seem to level off at a high tem- 
perature. 

The forces for Runs No. 1 and 2 are nearly the same but their 
discharge coefficients differ widely. If the force plots had been 
made as F/P,DC,X instead of F/PoDX, then they also would 
differ and the higher temperature would have the lower force. 
This is inconsistent with the data of run No. 9. 


Flow Characteristics When Exhausting Into Air. For flow at low 
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Fig. 16 


momentum and large (0.1 in.) openings, the fluid leaves the 
poppet in a solid sheet with the shape shown in Fig. 15. 

At small openings and low momentum the flow pattern is simi- 
lar to Fig. 16. The flow remained in a solid sheet until it reached 
the free edge labeled A where it broke up into discrete jets and 
bent until the jets were vertical. The flow continued to curl so 
that the fluid dropped back on top of the poppet. This “‘teapot 
effect’’ has been noted by others [7]. As the momentum in- 
creased, the jets straightened out until they left the poppet at 
the same angle as the valve, except for the 45-deg sphere, where 
the final angle was between 35 and 40 deg. 

Flow Characteristics When Exhausting Into Oil. When the experi- 
mental poppet exhausted into a chamber filled with fluid, it 
tended to be unstable and oscillated at a frequency of 1300 cps. 
Although the period of oscillation seemed to be independent of 
exhaust-chamber height, the poppet did not oscillate unless 
the exhaust chamber was of some critical length. For the cham- 
ber lengths at which the poppet did not oscillate, there were 
certain regions of flow at which the valve was on the verge of 
instability but, at large flows, the valve was stable. The diame- 
ter of the chamber does not affect the frequency, although the 
valve seemed to be more unstable with the larger diameter 
chamber. Occasionally the valve oscillated at a low frequency 
of '/, eps, but this seemed to occur at random times. 

As predicted by Equation (12), there was a low-pressure re- 
gion underneath the poppet and the dissolved air was released 
from the oil. The release of dissolved air seemed to be connected 
with valve stability. At very low flow, no air was released, but 
as the flow increased, air bubbles appeared in the fluid. These 
bubbles were mainly of two types, one under '/3: in. in diameter 
and the other about '/, in. in diameter. 

The small bubbles were very fine and numerous and took a 
long time either to redissolve or break up. The large bubbles 
were entirely different. The fluid around these bubbles was ex- 
tremely turbulent with a large amount of vorticity. When 
reaching the free surface, these bubbles often did not break up, 
but reversed their direction of motion and traveled counter to 
the gross fluid motion. Other bubbles of this type remained sus- 
pended in the middle of the chamber and oscillated vertically 
about a point. 

Just before the valve started to oscillate, the number of bubbles 
of both types increased. The large bubbles were more prevalent 
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when the valve oscillated. Just as the valve started to oscillate, 
the small bubbles could be seen coming off the poppet in a con- 
tinuous stream. The bubbles also increased in number when the 
valve was on the verge of oscillating, but did not oscillate. 


Conclusions and Recommendations 


A better understanding of discharge coefficients is needed. Be- 
cause there was a lot of scatter in the data and they differed from 
those of other experimenters, the numerical results should be used 
judiciously. No reasonable explanation is offered for these dif- 
ferences. 

At low Reynolds number, the value of C, is small, it increases 
toward unity at medium Reynolds number, and then de- 
creases toward the theoretical value at high Reynolds number. 
This is explained by noting that as Reynolds number increases, 
the passage appears first as a laminar restriction, then as a well- 
rounded orifice and finally as a sharp-edged orifice. 

The main discrepancy between the predicted and experimental 
value is probably due to friction in the fluid. Surface tension and 
rotation also may play a part. 

Although the flow forces were 20 per cent below the predicted 
value, this ‘error’? was reduced to about 10 per cent if experi- 
mental values were used for C,. This suggests that friction can 
considerably alter the flow forces. 

The formation of the jets is probably caused by surface tension 
although fluid rotation is also important at low momentum. The 
continual curving of the fluid jets is probably due to rotation. 

The downstream chamber configuration is important in deter- 
mining the poppet characteristics. The smaller the chamber, the 
higher the closing force. Although the turbulence in the chamber 
is very complex, Equation (12) gave an adequate representation 
for the poppet tested. 

The low pressure beneath the poppet al o caused the release of 
dissolved air. This phenomenon also seems to be related to valve 
stability, but it is not known if this is a cause or effect relation. 

A mushroom-shaped poppet was effective in reducing the flow 
forces. 

Recommendations for Further Work. A further study should be 
undertaken on discharge coefficients. The curves of this paper 
and of Tidor [14] suggest that there are several regimes of flow. 
Although it appears that this is a two-dimensional phenomenon 
three-dimensional effects should be studied. One of the most im- 
portant phenomena to be studied is the effect that the seat shape 
has upon the discharge coefficient. Eccentricity and cocking of 
the poppet may also be important. 

No real work can be done on flow forces until there is a better 
understanding of discharge coefficients. A better prediction of 
the momentum forces could be obtained by placing pressure taps 
along the surface of the poppet and integrating the resulting pres- 
sure field. Other variables which should be studied are viscosity, 
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surface tension, fluid rotation, cavitation, and release of dissolved 
air. 
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Crossover Systems Between the Stages 


GAYLORD O. ELLIS 


Senior Engineer, Research Department, 
Carrier Corporation, Syracuse, N. Y. 


of Centrifugal Compressors 


The location and orientation of vanes in an annulus are established on the basis of two 


principles which result from a discussion of simple elbows. These principles are in- 
corporated into a procedure for designing the diffuser vanes in the crossover passage 


between stages of a centrifugal compressor. 


To test the proposed design a low-speed 


model was butlt and tested and the results compared with a second model which repre- 
sented a more conventional design 


1.0 Introduction 


N MOST CENTRIFUGAL COMPRESSORS nearly half the 
energy imparted to the fluid leaving the impeller of a given stage 
is in the form of kinetic energy or velocity with meridional and 
tangential components. Before entering the next stage, the fol- 
lowing changes in the flow are necessary: 


(1) As much of the kinetic energy as possible must be converted 
to static pressure. 

(2) The flow direction must be changed from the tangential- 
radial direction to an axial direction. 

(3) The flow must be returned from the large diameter beyond 
the tip of the impeller of the given stage to the smaller diameter 
of the inlet to the subsequent stage. 


The system designed to accomplish this task will be termed a 
crossover system and in conventional centrifugal compressors 
consists of all or part of the following components: (1) A vaneless 
diffuser which partially diffuses the flow leaving the impeller; 
(2) an annular passage which directs the flow from the exit of the 


Contributed by the Fluid Mechanics Subcommittee of the Hy- 
draulic Division and presented at the Hydraulic Conference, Ann 
Arbor, Mich., April 13-15, 1959, of Tae American Society or Me- 
CHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, December 
15, 1958. Paper No. 59—Hyd-3. 


vaneless diffuser to the inlet of the subsequent stage; and (3) a set 
of vanes somewhere in the system to recover the kinetic energy 
of the velocity remaining in the fluid after passing through the 
vaneless diffuser. The crossover system is of major importance 
for the following reasons: 


(1) The length of flow path is long and spacing between passage 
walls is small leading to high friction losses. 

(2) The operating range of a stage is usually established by the 
design of the crossover system. 

(3) The over-all diameter of a compressor with a given im- 
peller diameter is determined primarily by the crossover design. 


For a given impeller operating under fixed conditions there ap- 
pears little that can be done to change the efficiency of the vane- 
less diffuser with our present knowledge. Improvements can best 
be obtained by studying impeller configurations to determine 
those for which efficient diffusers can be designed. The losses in 
the annular passage between the exit of the vaneless diffuser and 
the inlet to the second stage result primarily from the sharp turns 
required in the meridional plane. Generally speaking, the radii 
of these turns are already as large in existing designs as the space 
limitations will allow and thus there is limited opportunity to 
improve the design of these annular passages. 

This leaves the vanes to be considered. Here there is consider- 
able freedom in the design since the vane geometry can be varied 
in many ways and there is also a choice of location for the vanes. 
Thus improvements in design should be possible and, further- 


Nomenclature— 


absolute velocity 
meridional component of velocity 


distance measured normal to me- 
ridional lines 


blade blockage factor 
angle between mean blade surface 


r 


tangential component of velocity 

radial component of velocity 

axial component of velocity 

gradient of pressures on surface 
normal to flow 

component of pressure gradient 
normal to meridional stream- 
line 

tangential component of pressure 
gradient 

distance between hub and shroud 
measured along normal to me- 
ridional streamline 

distance ratio h/he» where the 
subscript indicates station 0-0 
of Fig. 5 

radius 
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static pressure 

total pressure 

velocity head, pc*/2g 

radius ratio r/ro+ 

radius of curvature, positive when 
da/ds is positive 

blade thickness 

distance measured along meridi- 
onal streamline 

axial distance 

angle measured between line tan- 
gent to the meridional flow direc- 
tion and the axis of the com- 
pressor (see Fig. 4) 

angle between flow direction and 
meridional plane, measured on 
surface of revolution generated 
by meridional streamline 

ratio of specific heats 


6 


and meridional plane, measured 
on surface of revolution gen- 
erated by normal to meridional 
streamlines 

mass density 

whirl (tangential component of 
velocity times radius) 

angular displacement 


Subscripts 


H,M,S = values measured at hub, mean 


line, and shroud, respec- 
tively 


8, p = values measured on suction 


and pressure surfaces of 
blade, respectively 


i — i = value at stations shown on 


Fig. 5 
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more, if the efficiency of the vanes can be improved, less diffusion 
may be required in the vaneless diffuser which can materially re- 
duce the over-all compressor size. It is suggested that properly 
designed and located vanes can assist in the turning of the fluid in 
the meridional plane. A procedure for designing the crossover 
vanes, which is the subject of this report, has been developed in 
the Research Department of Carrier. A set of vanes has been de- 
signed by this procedure and tested, the results of the test 
being reported herein. 

The design concepts of this report are applicable to the design 
of any annular cascade of rotating or stationary vanes which im- 
part a change in the moment of momentum (or whirl) of a fluid 
in a system where a change in meridional (axial-radial) flow direc- 
tion is also taking place. 


2.0 Basic Features of the Design 


2.1 Location of Vanes 


Since the turning of a fluid introduces pressure gradients with 
concomitant losses, it is desirable to accomplish a required change 
in direction or displacement of flow with a minimum of turning. 
(This is true provided, of course, that the passage geometry with 
minimum turning does not produce higher velocity heads, more 
wetted surface, or more disruptive secondary flows.) For instance, 
if it is desired to design a channel between points A and B in Fig. 1, 
it is obviously better to use two 45-deg turns than two 90-deg 
turns. While this is a rather trivial example, it demonstrates a 
principle which is applied to more complicated problems. 

For instance, consider the case of a set of vanes in an annulus 
which removes whirl that is followed by a 180-deg turn from a 
radial outward to a radial inward direction as shown in Fig. 2(a). 
Assuming that the flow enters the vanes at a flow angle of 75 deg 
measured from the radial direction, the mean velocity vector 


== 
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Fig. 2. Vane locations in annulus between stages of centrifugal compressor 


gyrates through angular changes which total just slightly less 
than 250 deg. This amount could be reduced to less than 190 deg 
if the vanes were located downstream from the meridional turn as 
in Fig. 2(6). But if these changes in flow direction (i.e., change 
in whirl and meridional flow direction) are considered as a single 
required change rather than separately, a channel can be de- 
signed as in Fig. 2(c) having a total turning of less than 160 deg. 
Thus, in designing a crossover system in which changes are re- 
quired in both the moment of momentum of a fluid and the 
meridional flow direction, the net required change can be ac- 
complished with a minimum of actual turning by the combined 
action of vanes and shrouds of the annulus if the changes occur 
simultaneously. 


2.2 Orientation of Vane Surfaces 


In order to turn a fluid, pressure gradients are established in it. 
These pressure gradients act across the passage, between the 
surfaces doing the turning, and create pressure differences on 
these opposing surfaces. For a given schedule of average velocity 
through the turn, the losses are related to the magnitude of the 
pressure differences existing along these surfaces. 

The magnitude of the pressure gradient is determined by the 
sharpness of the turn and the pressure differences on opposing 
walls of the channel are proportional to the mean pressure gradi- 
ent times the distance between the walls.'!. For a fixed pressure 
gradient the pressure difference on opposing walls is reduced by 
reducing the spacing between the walls. For example, consider 
the elbows shown in Fig. 3. From equation (11) and Fig. 4 of 
the reference given,' it can be shown that elbow 3(a) is superior to 
3(b) because the spacing between opposing surfaces on which 
pressure differences exist is much less for elbow 3(a) (although 
the flow areas are the same). 

Since the tendency to separate depends upon the maximum 
pressure difference it is expected that losses are influenced more 
by the maximum spacing occurring in a passage rather than by 
the average. Thus, although the average spacing between walls 
(in the plane of turning) is less in 3(c) than 3(@), the maximum 
spacing occurring along the diagonal of 3(c) is greater and this el- 
bow is therefore expected to be the less efficient. Not only is the 


1 For further information on the relation between loss and elbow 
geometry see: Airplane Heating and Ventilating Equipment, SAE 
Aeronautical Information Series, Report No. 2. 
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SECONDARY FLOW IN 
BOUNDARY LAYER 


Fig.3 Several examples of elbows with varying aspect ratios 


maximum spacing greater in 3(c) but it occurs at a section of the 
elbow where secondary flows tend to cause the boundary layer to 
build up the fastest as shown. 

Next, consider Example 3(d) in which the inclination of two 
sides (a and c) with respect to the plane of turning is the same as 
3(c), but the other two sides are aligned perpendicular to this 
plane so that the maximum spacing may be no greater than 
Example 3(a@) with only a small increase in the ratio of wetted 
perimeter to flow area of a cross section. Thus the efficiency of 
3(d) should be better than 3(c) although perhaps somewhat less 
than 3(a). 

From this discussion of simple elbows the following conclusion 
is drawn: The losses in a turn are influenced by the orientation of 
the cross section of the passage with respect to the plane of turn- 
ing and are usually minimized by aligning two sides of the passage 
perpendicular to the plane of the turn. It is suggested that this 
conclusion might be stated more generally as follows: If a pres- 
sure gradient exists in a fluid flowing in a channel, losses in gen- 
eral can be minimized if two surfaces of the channel are aligned 
perpendicular to the gradient, provided that the ratio of wetted 
surface to flow volume does not become too large. 

Now return to the cascade of vanes in an annulus. In annular 
cascades two sets of surfaces occur: (1) The vane surfaces and (2) 
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the inner and outer shroud surfaces. Together they form a flow 
channel of more or less rectangular cross section. Although the 
surfaces which form the inner and outer shroud of the annulus are 
not necessarily restricted to surfaces of revolution, they will in 
general be selected as such for manufacturing reasons. Thus the 
vane surfaces are selected as the two sides of the rectangular cross 
section which are to be oriented in a manner to obtain the mini- 
mum losses. That is, the blade surface is aligned perpendicular to 
the pressure gradient in the cross section of the passage, so that 
no variation in pressure occurs along the blade span. 


3.0 Design Procedure 


3.1 Definitions and Assumptions 


3.1.1 Co-Ordinates. In the following discussion use the 
cylindrical co-ordinates, r, 8, and z shown in Fig. 4(a). 

In addition to the cylindrical co-ordinates, use is made of a set 
of orthogonal co-ordinates on the meridional plane, consisting of 
meridional lines uniformly spaced between shrouds and their 
normals. The distance measured along the meridional line is de- 
noted by the symbol s. Similarly, the distance measured along 
the normals is designated by the letter n. 
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Fig. 4(a) Surface of revolution generated by meridional line with co- 
ordinates and velocity components 


3.1.2 Angular Relationships. In order to discuss the flow and 
define the blade shape certain angles must be defined. Two of 
these angles are essentially related to the flow direction. If the 
flow direction is assumed to be parallel to the blade surface, these 
two angles plus one more also define the blade surface locally. 

The flow direction at a given point is described by two angles 
aand 8. The angle a is measured between a line tangent to the 
projection of the flow direction on the meridional plane (plane 
containing the axis of the blade row) and the axis of the blade 
row, as shown in Fig. 4(a). The angle 8 is measured on the sur- 
face of revolution generated by the stream line projection on 
the meridional plane as shown in Fig. 4(a). It is assumed that 
the meridional streamlines can be approximated by the uniformly 
spaced meridional lines between hub and shroud. (Note that a 
general surface is shown in Fig. 4 and is not necessarily one as- 
sociated with the crossover vanes.) The components of velocity 
in the direction of the co-ordinates previously defined are related 
through these angles by 


ceosB 
csin B 
= c,, SiN @ 


= C, COS 


It is assumed that an adequate description of the blade surface 
in the immediate neighborhood of a point is given by determining 
two nonparallel lines which are tangent to the surface at the 
point. As will be indicated in the following section, the flow is 
assumed to be tangent to the mean blade surface. Under this 
assumption, a line tangent to the flow direction and defined by @ 
and £ is also tangent to the mean blade surface. The second 
tangent line is selected to be that which is tangent to the inter- 
section between the blade and the surface of revolution generated 
by rotating a normal to the meridional flow direction about the 
axis of the blade row. This tangent line is located by the angles 
a and ¢, where ¢ is the angle between the tangent line and the 
meridional plane, shown in Fig. 4(6). 

3.1.3 Assumptions. To avoid the prohibitive amount of effort 
required to consider all the three-dimensional aspects of flow, 
certain simplifying assumptions are necessary. The justification 
for the assumptions depends only upon whether or not blades, 
designed under the assumptions, demonstrate the desired charac- 
teristics. 
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INTERSECTION BETWEEN SURFACE OF REVOLUTION AND 
BLADE SURFACE 


INTERSECTION BETWEEN SURFACE OF REVOLUTION AND 
MERIDIONAL PLANE 

SURFACE OF REVOLUTION OBTAINED BY ROTATING NORMAL 
TO MERIDIONAL FLOW DIRECTION ABOUT AXIS OF BLADE ROW 


Fig. 4(b) Surface of revolution generated by normal line with blade 
intersection shown 


Assume that, in the design calculations, flow parameters and 
their derivatives over any normal cross section of the blade pas- 
sage can be adequately represented by an average value and that 
these average values occur on a line representing the intersection 
between a mean blade surface and the mean surface of revolution 
between the hub and shroud. It is further assumed that this line 
is parallel to the average flow direction. A part of the design 
problem is the determination of the geometry of this line. 

The fluid is assumed to be inviscid. 


3.2 Design Procedure 


The procedure for designing vanes for an annulus, such as 
represented by the schematic diagram of Fig. 5, is as follows: 
The vanes are located to remove whirl at the same time the 
meridional flow direction is changing from radially outward to 
radially inward, i.e., from station 0-0 to station 10-10 in Fig. 5. 
As the vanes extend around the meridional turn, their thickness 
is increased as partial compensation for the diffusion resulting 
from the removal of whirl and a thick blade section results at 
station 10-10 where the turn is complete. To avoid an abrupt 
change, a rather long tail is added, along which the thickness de- 
creases gradually. 

3.2.1 Mean Blade Element. Future experimentation may de- 
termine the optimum manner in which moment of momentum, or 
whirl, should be removed from a fluid leaving the vaneless dif- 
fuser. At present, however, experience offers no conclusive basis 
for prescribing the optimum manner of removing whirl and so it is 
suggested arbitrarily that the whirl be changed at a constant rate 
between stations 0-0 and 10-10. In other words let 


X=as+b 


where \ is the average whirl, or moment of the momentum, at a 


given cross section of the passage. From the impeller charac- 
teristics and estimated losses of the system, the inlet conditions 
to the vane are presumed to be known. If the length of the mean 
meridional streamline between stations 0-0 and 10-10 is desig- 
nated by s,, and s is taken to be zero at station 0-0 then 
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Fig. 5 $100 model, flow passage schematic 


XK = — s/s,) (2) 


assuming all the whirl is to be removed. 

It would be desirable if the mean velocity of the flow could be 
held constant or even accelerated in the region of turning, i.e., 
station 0-0 to 10-10. Usually, however, there is not sufficient 
space downstream of the turn (station 10-10) to allow efficient 
diffusion to take place and so it is probably better to allow 
some diffusion to occur in the turning section. Again an accurate 
determination of the optimum velocity schedule awaits future 
experimental investigation; so it is an arbitrary suggestion that 
about 20 to 30 per cent of the total diffusion in the crossover 
blades be taken in the turning region. 

As already noted, the inlet conditions to the blades are as- 
sumed to be known and from the passage geometry at the trailing 
edge of the blade tail, the meridional velocity c,, is known. As- 
suming all the whirl is to be removed completely this is the mean 
discharge velocity. 

If the diffusion between station 0-0 and 10-10 is taken to be 
linear 


= &» — (3) 


where (Ac) is the total change in mean velocity occurring between 
the inlet and the discharge of the vanes and z is the per cent of the 
diffusion to be taken in the turning sections. 

From Equations (1) 


= sin 8 


The mean flow direction and, therefore, by our approximation, a 
tangent line to the mean blade surface is given by the prescribed 
mean surface of revolution (between shrouds) and the angle B 
given by 

(1 — 8/s,) 
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Fig.6 Components of force and accelerations acting on fivid particle 


sin 1 — s/s, 
= 


Ae | (4) 
8/8, 
fo 0 


The 6 co-ordinate of the blade along this mean line is obtained 


by integrating 
* tan B 
= me (5) 
0 r 


3.2.2 Blade Orientation. The next step in the design of the 
blade is to determine the inclination of the blade at several points 
on the mean line such that the second principle mentioned in an 
earlier section is satisfied, namely, that the blades will be oriented 
normal to the average pressure gradient in a plane normal to the 
flow. The following development determines the values of [ 
which satisfy this condition. 

The meridional components of pressure forces acting on a fluid 
particle in a direction normal to a meridional streamline are shown 
in Fig. 6. Since the normal component of velocity is always zero, 
the blade and pressure forces equal the accelerations shown 


prd@ds — (> + in) rd@ds = 
on 


c,* 
— — cos ap dnds r d@ — —™ pdn ds r dO 
r 


which becomes when simplified 
(6) 


Note that for the case shown (da/ds is negative), c,,?/r, has a 
negative sign in front. If vanes were being designed for a case 
where da/ds was positive the sign in front of this term would also 
be positive. 

Similarly the tangential component of pressure forces acting on 
a fluid particle produces a change in the moment of momentum of 
the particle so that 


Ov 7) 
r 

The variation in the tangential direction of moment of momen- 
tum is usually smal] compared to the variation in the meridional 
direction, however, so the last term of Equation (7) will be as- 
sumed negligible. 

Now the gradient of the pressure values on a surface normal to 
the flow is a vector F with components 
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Combining equations (6), (7), and (8) gives 


c,? cos a + — c,,? 


Now ¢ and ¢’ have been defined so that the blade is normal 
to the average pressure gradient if 


Thus the condition that the blade span be aligned normal to the 
average pressure gradient in a normal cross section of the passage 


; 
is satisfied if (note thatc, = — 


INLET SCREEN 
FAN & MOTOR SECTION= 


—_ 


TRANSITION DUCTS— 


where the values for Equation (9) are taken to be average values 
along the mean line of the passage. 

The mean blade shape is now formed by moving a straight line 
along the mean meridional line between hub and shroud while it 
is oriented in a manner such that its projection on the meridional 
plane is always normal to the mean hub-shroud line and its in- 
clination from the meridional plane is given by the angle ¢. 
Then along the normals on the meridional plane the blade surface 
is given by 


(tan 8 sin a + tan ¢ cos @)yr cos (9 — Oy) — r sin (0 — By) 
+ (tan B cos a — tan [ sin @)y (z — zy) 


=ry (tan 8sina + tanfcosa@)y (10) 


where the subscript M indicates the calculated values on the mean 
hub-shroud line on the meridional plane. 

To determine the mean blade shape for the crossover vanes, 
solve Equation (10) for @ at the intersections of the hub and 
shroud with the normals at stations 0-0 through 10-10 as shown. 
Plot values of @ against the distance s measured along hub and 
shroud as shown in Fig. 7. The computed values in general do not 
necessarily produce an inlet condition with proper alignment for 
shockless entry in the region of the hub and shroud and some slight 
modifications may be necessary to assure that this condition is 
satisfied. 

3.2.3. Blade Thickness. The mean blade surface is now com- 
plete, values of r, 6, and z having been determined at three points 
along every normal. It remains to determine the thickness of the 
blade at these points. First, a blockage factor is defined by 


(0, — 8,) 
Aa 


where @. is the angular displacement to the suction surface of the 
channel and 6, is the angular displacement to the pressure surface 
and Aé@ is the angular displacement between mean blade surfaces. 
The blade thickness is related to 6 by 


t = (1 — 6)r( A) (11) 


The annular cross-sectional flow area between hub and shroud 
less the area blocked off by the blade thickness is 


A = 2n7-h-6 
2m(rh)o o Ry: 


= measured distance between hub and shroud along line 
normal to mean flow path 


CROSSOVER MODEL 


BYPASS THROTTLE 
FLOW STRAIGHTENER 


ECTION— 


Fig. 8 Schematic sketch of crossover model test equipment 
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The velocities have been prescribed along the mean line so that 
pc,,A = (pc,,*A)o~» the density ratios can be determined by 


Fig. 10 Discharge end of model showing return channel vanes ($101 model) and instrumentation 
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a 
H = h/ho- 
R = r/To-o 5 
Now 
Fig. 9 Crossover model test equipment assembly 


Fig. 11 Prewhirl vanes actuating mechanism and indicator 


Fig. 12 
inlet 


Interior of model showing arrangement of prewhirl blades and 


2 
where dy is the stagnation speed of sound. 

6 is assumed to be constant along the normals to the meridional 
streamlines and is computed then on the mean line between hub 
and shroud and the thickness at other points on the normal are 
computed from Equation (11). 


4.0 Low Speed Tests 


A low speed crossover model, designated $100, was designed by 
the procedures of this report and tested. For comparison, another 
model, designated 8101, was also tested which simulated more 
conventional design. 


4.1 Design of the Model 


4.1.1 Design Considerations. While the ideal conditions for 
testing any piece of equipment would be under actual operating 
conditions, the factors of sealing, cost, flexibility, and time con- 


162 / mMarcH 1960 


/ 


Fig. 13 Model partially assembled with diaphragm and crossover vanes 
of $100 model 


Fig. 14 Diffuser side of diaphragm showing arrangement of $100 
crossover vanes 


fined any detailed preliminary research to a test setup using low 
speed air at ambient conditions and to the use of wood or other 
easily fabricated materials for the flow passages. This at once 
poses the question of dynamic similarity. It is reasonable to as- 
sume, however, that on a consistent basis, the model configuration 
yielding the better performance in air will be better under actual 
operating conditions. 

4.1.2 Design of the Test Equipment 

4.1.2.1 The Test Rig. Rather than provide a complete first- 
stage model which would involve design of an impeller and asso- 
ciated drive equipment, it was decided to use a fan for the air 
supply and simulate the gas whirl in the diffuser by a set of pre- 
whirl vanes placed approximately at the same relative position 
the impeller tip would have occupied. 

Shown schematically in Fig. 8, the model test equipment can 
be seen in the photographs of Figs. 9 and 10. Figs. 11, 12, 13, and 
14 show various parts and stages of assembly of the model itself. 

Referring to Fig. 8, the air enters the 20-hp Joy Axivane Fan and 
then passes to the bypass throttle section. This consists of an 
axially slotted cylindrical duct with an outer sliding band also 
slotted in the same manner. This band may be moved to un- 
cover or completely close the slots in the cylindrical duct and 
allows the fan to be matched to the model at its peak pressure 
ratio point. Passing on through a transition duct, the air then 
passes through a flow straightener section with 8 mesh screens at 
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Station 8M TM Au tan By Ou tan 8H 6, 
0-0 0 1.500 279.9 50 =3.732 0 —0. 841 1.0 0 0.0500 —0.0530 
1-1 0.0498 1.549 251.9 60 2.710 0. 1020 —1.285 0.0296 0.1375 0.0698 0.0515 
2-2 0.0998 1.593 223.9 70 8 2.007 0.1760 —1.547 0.7205 0.0596 0.2350 0.1408 0.1195 
3-3 0.1496 1.628 195.9 80 1.504 0.2310 —1.722 0.0902 0.2980 0.2098 0.1700 
44 0.1994 1.650 167.9 90 = 1.130 0.2725 —1.862 0.5824 0.1215 0.3445 0.2782 0.2112 
5-5 0.2500 1.658 139.9 100 0.843 0.3020 —1.988 0.1540 0.3785 0.3490 0.2450 
64 0.2991 1.650 111.9 110 =60.616 0.3235 —2.108 0.5162 0.1874 0.3990 0.4113 0.2665 
7-7 0.3489 1.628 83.9 120 0.429 0.3395 —2.240 0.2218 0.4100 0.4764 0.2820 
RN 0.3988 1.593 55.9 130 0.270 0.3500 —1.946 0.4938 0.2572 0.4110 0.5408 0.3000 
99 0.4486 1.549 27.9 140 0.133 0.3560 —1.498 0.2931 0.4020 0.6040 0.3120 
10-10 = 0.4984 1.500 0 150 O 0.3585 —1.000 0.5000 0.3294 0.3910 0.6676 0.3205 
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Fig. 15 $101 model, flow passage schematic 


each end. Between them the duct is packed with thin wall card- 
board tubes, 1’/, in. diameter. A second transition section brings 
the air to the inlet of the model. 

The mode] inlet is shaped similar to the hub and shroud contour 
of an impeller but the shroud contour is such as to give a uniform 
acceleration along the mean streamline. The flow is radial at the 
end of the inlet section where it then enters the whirl inducer, or 
prewhirl blades (see Figs. 11 and 12) which give the air atan- 
gential component to simulate the flow from an impeller. There 
are 36 prewhirl blades, all simultaneously adjustable. 

The prewhirl blades are followed by the parallel wall diffuser 
section of the model proper which extends radially to the cross- 
over section. In the S100 model, the crossover vanes were lo- 
cated at the tip of the diffuser, while in the 8101 model the cross- 
over was vaneless and only return channel vanes were installed. 
The flow passage of both models is shown schematically in Figs. 5 
and 15. 

The outer diffuser and return channel walls are of clear Plexi- 
glas for observation of the flow. The diaphragm assembly, which 
forms the inner walls of the diffuser and return channel, carries 
the crossover or the return channel vanes. The entire diaphragm 
assembly can be rotated in the case. 

4.1.2.2 Design of Crossover Vanes (S100 Model). In ac- 
cordance with the principle presented earlier, the turning portion 
of the vanes was confined to the region of the annulus commonly 
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referred to as the crossover elbow. As seen in Fig. 5, the inner 
and outer walls converge slightly to partially compensate for the 
diffusion resulting from the removal of whirl by the vanes. In 
addition to this convergence of the shroud walls, the thickness 
of the blades was gradually increased which results in a thick 
blade section at the completion of the turn. In order to avoid an 
abrupt change in vane thickness, a straight tapered section was 
added to the vane following the region of the turn along which the 
thickness was gradually reduced. 


= The flow area normal to the turn was adjusted to allow some 
= deceleration in the vanes in the region of the turning. After the 
< turn, the rate of diffusion is higher than desired but necessary in 
LONG VANE order to meet the inlet conditions of the simulated second stage. 
AILING EDGE 


From the known operating characteristics of the fan and esti- 
mated losses in the system, the meridional velocity at the inlet 
to the vanes (station 0-0, Fig. 5) was calculated to be 50 ft/sec 
when the prewhirl vanes were set to give a flow angle of 75 deg at 
the vane inlet. Thus at station 0-0 


= ¢,, tan 


Cua = 50 X tan 75 deg 
= 186.61 ft/sec 


A = re, 


ow = 1.5 X 186.61 
= 279.91 ft?/sec 


The whirl is to be zero at station 10-10 with a linear variation 
in Ay between station 0-0 and 10-10. The values assumed for 
Ay on the mean line are given in the fourth column of Table 1. 

Approximately 20 per cent diffusion was allowed in the turning 
section making the discharge velocity equal to 150 ft/sec. This 
is equal to Cmio-10 and a linear variation of c,, between the inlet 
value of 50 ft/sec and this value was assumed, which is shown in 
the fifth column of Table 1. From Ay and ry the value of ¢,y, is 
determined from which 


values for which are shown in column 6. The angular displace- 
ments of points on the mean line follow in column 7 which were 
determined by Equation (5). 

From the values already determined, tan {y was calculated 
using Equation (9) which are the values shown in column 8. 

With a, (from Fig. 5), By, Ss, Ty, and 2, known Equation (10) 
yields the angular displacement of points of the mean blade line 
on the inner and outer shroud. These are listed in columns 11 and 
13 of Table 1 as well as plotted in Fig. 7. 
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Fig. 16 Profile of return channel vanes of $101 model 


There remains only to specify the blade thickness which is 
dione in terms of 6 from Equation (12). 

4.1.2.3 Design of Conventional Return Channel Vanes (S101 
Model). The return channel vanes for this model were scaled 
directly from an existing compressor as nearly as possible from a 
geometric standpoint. A strict adherence was not possible be- 
cause the ratio of the radius at the leading edge to the radius at 
the trailing edge was not the same. Alternately long and short 
blades were used; the blades are of constant thickness and the 
camber line is constructed as a series of tangent circular arcs. 


Blade specifications 
Radius to leading edge ay 
Radius to trailing edge, long blade. . . 11.00 in. 
Radius to trailing edge, short blade.... 14.32 in. 
Blade thickness. 9/32 in. 
Number of short blades... 16 
Number of long blades. 16 


The profile of the blades and their actual location relative to 
each other is shown in Fig. 16. 


4.2 Instrumentation 


4.2.1 Flow Measurement. Program objectives necessitated 
traverses to be made in the diffuser and return channel and, where 
possible, use of rake probes for flow profiles. In addition, static 
pressure taps were provided. The instrumentation is shown 
schematically in Fig. 17 and actual instrumentation is seen in 
Figs. 9 and 10. 

The instrumentation of the $100 and 8101 models were identical 
except for the locations of the diffuser traverse stations, the sta- 
tion for the S101 being at a somewhat smaller radius than the 
$100. 

The reader will be spared a detailed account of each piece of 
instrumentation but the information is on file at Carrier and the 
author will be happy to supply it on request. 

In order to provide a peripheral traverse between blades, the 
entire diaphragm assembly which carries the blades was made a 
sliding fit inside the model casing. This allowed the diaphragm 
to rotate and provide the necessary traverse behind the blades. 

4.2.2 Visual Observation of Flow. To allow observation of the 
flow with tufts or streamers, the outer walls of the diffuser and re- 
turn channel passages were made of transparent Plexiglas, as was 
a section of the crossover wall which covered an are of 30 deg. 
Thus, streamers attached to the walls and vane passages were 
readily visible and the flow pattern could be observed directly. 
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Fig. 17 Radial cross section of $100 model showing schematic 
instrumentation locations 

Traverse station, front cover 

Wall statics, front cover, 3 sets of 4 holes each spaced 120 deg 

apart 

Wall statics, rear cover, 3 sets of 6 holes each spaced 120 deg apart 

Traverse station, rear cover, one only 

Rake probe holes, front cover, 3 sets 120 deg apart 

Rake probe holes, rear cover, 3 sets 120 deg apart 


4.3 Data Reduction and Calculations 


The loss coefficient of the model may be expressed as: 


where the subscripts 1 and 2 refer to upstream and downstream 
conditions, respectively. 

In evaluating p,, the mass-weighted integrated mean was used. 
Since the air may be considered incompressible at these low 
velocities, this reduces to the volume weighted integrated mean 
and may be expressed as 
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Fig. 18 Diffuser flow angle and normalized velocity distribution for the 
three peripheral stations 
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Fig. 19 Diffuser flow angle and normalized velocity distribution for the 
three peripheral stations 
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Likewise, 


44 Test Results 

While the peripheral distribution of the total velocity head in 
the vaneless diffuser section was reasonably uniform, the meridi- 
onal component of velocity c,, was badly distorted. This, of 
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Fig. 20 Diffuser flow angle and normalized velocity distribution for the 
three peripheral stations 
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Fig. 21 Diffuser flow angle and normalized velocity distribution for the 
three peripheral stations 


course, produced an undesirable variation in the flow angle ap- 
proaching the vanes. All efforts to correct this condition (which 
has not been found in vaneless diffusers of rotating impellers) 
were unsuccessful. The only recourse, therefore, was to take 
care that upstream and downstream measurements were taken 
on the same streamline of flow. To determine the proper location 
of the downstream stations relative to the upstream ones, smoke 
was used to trace out a streamline path. Since the flow was 
steady with time, it is felt that this procedure leads to significant 
results. 

A considerable variation in c,,, and therefore flow angle, also 
occurred from wall to wall of the vaneless diffuser as shown in 
Figs. 18 and 19. These surveys were taken at the radial location 
indicated by Ain Fig. 17. It is seen that for the 20-deg setting of 
the prewhirl vanes the actual flow angle varies approximately 
+15 deg from the design angle of 75 deg for the vane inlet and the 
mass averaged angle turns out to be about six degrees less than 
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Fig. 22 Faired normalized discharge velocity distributions for the three 
peripheral stations 
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Fig. 24 Faired normalized discharge velocity distributions for the three 
peripheral stations 


the design angle. At a prewhirl setting of 15 deg the mass aver- 
aged flow angle is approximately 10 deg below the design angle. 

Similar distributions in the vaneless diffuser are shown for the 
$101 model in Figs. 20 and 21. The flow conditions at this up- 
stream control station appear to be at least as favorable for the 
$101 vanes as for the 8100 vanes. 

The calculated loss coefficients for the two models are shown in 
Table 2. In both cases the optimum condition obtainable oc- 
curred with the prewhirl indicator set at 20 deg. (Note that a 
prewhirl setting beyond 20 deg could not be obtained.) At this 
point the losses of the proposed crossover vane design are only 
51 per cent as large as those in the representative conventional 
design. This corresponds to an improvement in over-all efficiency 
of about 4 points for a two-stage compressor having a pressure 
ratio of 3.0 and an efficiency in the neighborhood of 75 per cent: 

It is noted that the $100 model shows a much greater sensitivity 
to angle of attack. Much of this can be attributed to an unfor- 
tunate choice of very sharp leading edges for these vanes as com- 
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Fig. 23 Faired normalized discharge velocity distributions for the three 
peripheral stations 
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Fig. 25 Faired normalized discharge velocity distributions for the three 
peripheral stations 


Table 2 
Loss coefficients 


—Prewhirl setting angle— 


Modei 20 deg 15 deg 
$100 0.32 0.65 
$100 0. 202 

S100A 0.34 0.70 
$101 0.63 0.73 


@ Toend of vanes. 


pared to very blunt and rounded leading edges for the S101 
vanes. 

Another factor must be considered here, however. As the flow 
rate changes the angle of attack on the vanes changes. The 
author has observed cases where the flow angle at a given radius 
in a vaneless diffuser varied widely with flow rate, not so much 
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because of the change in flow volume as due to a change in c,, pro- 
file which accompanied the flow change. 

On the other hand, the velocity profile after the turn, while dis- 
torted, may be well stabilized by the turn so that the only effect 
on flow angle is just that due to the volume change. 

Thus, vanes with leading edges located ahead of the crossover 
turn may have to operate through a wider range of angles of at- 
tack than those downstream from the turn. 

In addition to a low loss coefficient, a good set of vanes should 
produce a flow into the second stage with a minimum of distortion. 
Exploratory traverses with streamers showed the discharge from 
the S100 model to be axial across the annulus. The S101 model, 
however, had not removed all the whirl from the air; a constant 
whirl of about 30 deg with rotation existed over the entire an- 
nulus. 

Measurements obtained from rakes located at F in Fig. 17 were 
used to calculate the velocities shown in Figs. 22 and 23 for the 
$100 model and in Figs. 24 and 25 for the 8101 model. The 8101 
model is seen to give a somewhat superior profile, the 8100 model 
having a small region of separation at the outside radius. The 
superiority of S101 over the S100 in this respect was attributed to 
the fact that a larger deceleration in the vanes allowed a sub- 
stantial re-acceleration of the flow in the turning annulus before 
the simulated second stage inlet. 

To test this conjecture, the 8100 model was modified by making 
a sudden expansion just downstream from the vane trailing edge 
and then re-accelerating in a manner similar to the 8101 model. 
This model, designated S100A, had a loss coefficient only two 
points higher at the best point and resulted in a discharge profile 
which compared very favorably with 8101, shown in Fig. 26. 


4.5 Conclusions 


For the tests described, the S100 vanes demonstrated a su- 
periority over the 8101 vanes. The loss coefficient of the 8100 
vanes was lower by approximately '/:, the vanes removed all the 
whirl and with modification gave a profile at the second stage 
inlet that compared favorably with the S101 vanes. It is be- 
lieved that the better performance resulted from: 

1 Less total turning of the flow. 

2 Possibly better inlet conditions just ahead of the 180 deg 
crossover turn than after it. 


3 A longer path length over which the turning and diffusion 
could be co-ordinated. 
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4 Orientation of vane surfaces throughout the turning process 
to give favorable passage aspect ratios. 


The optimistic results of these tests, however, must not be 
taken as an absolute assurance that the performance of all com- 
pressors will be improved through the use of the proposed cross- 
over vanes. As already mentioned, the profile of the meridional 
component of velocity, at a given vaneless diffuser radius, may 
vary from full channel flow to one having a back flow on one wall 
or the other as the compressor flow rate is changed. The resulting 
wide variations in flow angle could well make the efficiency of the 
vanes very sensitive to changes in flow rate, leading to a narrow 
compressor characteristic. 

Furthermore, the flow rate at which the profile of the meridional 
velocity component is most flat and symmetrical at the crossover 
vane inlet (corresponding to the optimum point for the cross- 
over vanes) will not, in general, correspond to the optimum point 
for the impeller. The anticipated improvement in peak efficiency 
may not be realized if this occurs. 

Thus it may be necessary to study the vaneless diffuser and 
discover ways of optimizing and, most importantly, stabilizing the 
flow in the vaneless diffuser before the advantages of the pro- 
posed crossover design can be realized. 


Acknowledgments 


The author is indebted to Mr. James A. Papapanu, Senior En- 
gineer, Carrier Corporation, under whose direction the experi- 
mental phase of this work was carried out. 


DISCUSSION 
R. C. Dean, Jr.,2 and W. A. MeGahan® 


The suthor’s considerations in the design of turning diffusing 
channels for the crossover systems of centrifugal compressors 
certainly warrant serious consideration. It is unfortunate that 
this worthy investigation is marred on its experimental side by a 
problem that is only now becoming appreciated. The trouble 
arises from the use of stationary swirl vanes to produce a simu- 
lated impeller discharge flow in a vaneless diffuser. While this 
is a common technique, detailed investigations in the M.1.T. Gas 
Turbine Laboratory, particularly that of Isom [1],‘ have re- 
vealed that mixing shear between the swirl vane wakes and the 
through-flow between blades is not sufficient in general to accel- 
erate the wakes against the adverse radial pressure gradient in 
the diffuser. Isom, who was attempting to produce an axisym- 
metric flow in a vaneless diffuser for skewed boundary-layer 
studies, discovered to his dismay that he had instead a diffuser 
discharge flow at a discharge radius ratio of about 2.0 containing 
uniformly spaced regions of extreme total pressure deficiency cor- 
responding exactly to the number of swirl vanes. The swirl 
vanes were NACA turbine profiles operating at optimum con- 
ditions and were not stalled. The flow in the diffuser was 
steady, but the static pressure rise was only about one half of 
that expected from the inlet swirl and radius ratio even when 
accounting for wall friction. 

More recently, workers in the same laboratory have discovered 
that a rotating stall phenomenon can occur in vaneless diffusers 
for inlet angles greater than 70 deg from the radial direction. This 
phenomenon is probably related to the rotating disturbances ob- 
served by Deak, Maroti, and Kreith [2] between stationary and 
rotating disks. 

These difficulties so badly distort the flow in the vaneless dif- 
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fuser, offering a circumferentially varying flow velocity and angle 
to the return vanes with the possibility of backflow, that consid- 
erable doubt must be cast upon the author’s experimental re- 
sults. Effecting measurements at vane row inlet and outlet 
along the same streamline is not sufficient since the various pas- 
sages will behave differently; some passages could be stalled or 
even flowing backward. 

Incidently, the M.I.T. investigators have developed a simple 
solution to this experimental difficulty which involves the use of 
rotating screens (or vanes) to impart swirl to the diffuser inlet 
flow. In that case, the wakes of the screen wires (or vanes) dis- 
appear rapidly and the flow has been proved with detailed probing 
to be axisymmetric. 

The very large change of loss coefficient with a 5-deg change in 
inlet angle of the $100 compared to the $101 vanes further illus- 
trates the severe effect of inlet distortions. A practical question 
of importance would be: “Is the $100 design so extremely sen- 
sitive to inlet conditions as to severely complicate the design 
problem for untested applications and would this sensitivity have 
an adverse effect upon the surge limit of the stage?’’ 

The author’s Figs. 22 through 26 suggest strongly that the 
crossover exit flow (at entry to the next stage) was stalled on the 
outer wall of the meridional passage. One would expect the 
velocity to be highest on the convex side of the bend and lowest 
on the concave side; however, all the author’s figures show that 
the converse was observed. Such large deviations from poten- 
tial flow must be due to separation in the bend or upstream there- 
of. The existence of separation at this point cannot be con- 
sidered lightly because of the very great potential damage to the 
performance of the following stage—a loss which may be of far 
greater consequence than that in the return crossover. 
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Author's Closure 


The author appreciates Messrs. Dean’s and McGahan’s in- 
terest in his paper. It now appears that the peripheral distortions 
observed are characteristic of radial flow with swirl produced by 
stationary vanes and it is indeed unfortunate that the author was 
not aware of this at the time the test setup was designed. 

It is agreed that taking measurements along the same stream- 
line does not guarantee significance to the results and the author 
apologizes for a naive statement. It was found in this investiga- 
tion, however, that if this was done, an acceptable correlation of 
loss coefficient could be obtained between tests in which the orien- 
tation and configuration of the distortion were varied. Not only 
could a consistency be obtained between tests, but for any given 
test, the loss coefficients calculated from measurements taken at 
the three peripherally spaced sets of instruments were in agree- 
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ment—this in spite of the fact that each set of probes experienced 
different regions of the distortion. 
data demonstrate this point. 


The following representative 


$100 vanes—prewhirl setting, 20 deg 
Peripheral 
station Pe 


9 


This is, of course, a very surprising result, indicating that the 
distortion persists throughout the test section with almost no 
mixing taking place. It is perhaps pertinent that the flow decel- 
erates throughout the test section except in a small region at the 
discharge in the case of the S100-A and S101 models. 

In addition to this consistency, further substantiation was 
obtained from tests of two other models (one crossover design and 
one return channel design) which were not reported. These 
models exhibited considerable variation, not only in the diffuser 
vanes but also in the vanes producing swirl and the annular 
passages. Tests of these models gave the same comparative re- 
sults, the lowest loss coefficient of the crossover vanes being 
0.36 which for the return channel vane was 0.63. 

It is well that Messrs. Dean and McGahan re-emphasize that 
the sensitivity of the crossover vanes to angle of attack could have 
an adverse effect upon the operating range of a stage. In the 
second crossover design, mentioned in the foregoing, this sensitivity 
was reduced by a well-rounded, blunt leading edge. The loss co- 
efficient varied from 0.36 with a prewhirl setting of 20 to 0.44 with 
a prewhirl setting of 15. As pointed out in the text, however, the 
major cause of sensitivity to flow rate may be the instability of 
the velocity profile in the vaneless diffuser. The potential ad- 
vantage of the crossover vanes may not be realized if stable and 
uniform conditions are not provided in the vaneless section of the 
diffuser. 

The downstream survey station (position F, Fig. 17) is located 
after the meridional turn has been completed. Thus, on a poten- 
tial basis, the flow would have returned to an approximately uni- 
form distribution (note that the potential flow becomes truly 
uniform at infinity). Because the velocities on the inner radius of 
the turn are higher than on the outer radius, however, the losses 
there will be higher. Thus downstream from a turn the lower 
velocities are generally found on the side of the turn extending 
from the inner radius. Studies with smoke and tufts showed no 
separation in the meridional turn at the crossover exit for the 
$101 and 8100-A models. 

Certainly, the distortion at this point, even without separation, 
will have a serious influence on the operation of the subsequent 
stage. It is doubtful, however, that more than moderate improve- 
ment can be obtained by further development of the crossover 
vanes alone. Consideration should be given, therefore, to devices 
for equalizing velocities after the turn such as annular splitters, or 
vortex generators on the inner radius of the turn. 
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Isolated Surface Irregularities 


The inception of cavitation on isolated surface irregularities imbedded in a turbulent 
boundary layer is investigated experimentally and theoretically. 


Two families of 


cylindrical roughness elements having constant cross sections are studied. One family 


has a circular-arc cross section. 


The other family has a triangular cross section and 


was selected to simulate the separating flow which is typical of an actual surface ir- 
regularity. The theoretical minimum-pressure coefficient for the circular-arc irregulari- 
ties is determined as a function of the relative height of roughness for several values of 


the boundary-layer shape parameter. 


Cavitation tests in the water tunnels of the 


Ordnance Research Laboratory on roughness elements ranging from 0.002 to 0.5 in. in 
height indicate that the incipient-cavitation number of an isolated surface irregularity is 
dependent upon the relative height of roughness, the boundary-layer shape parameter, 
the velocity, and other variables as yet unknown. 


Introduction 


AVITATION On a marine propeller or any similar de- 
vice can result in reduced efficiency, noise, and erosion or pitting 
of the surface. One of the major problems in the design of hy- 
draulic equipment is the prediction of the onset of cavitation, so 
as to design for cavitation-free performance. 

Cavitation occurs when the minimum pressure on a body is 
approximately equal to vapor pressure. Experience has shown 
that if a body is sufficiently smooth the onset of cavitation can be 
determined adequately from the theoretical value of the minimum 
pressure. The 
roughness may be the random distributed roughness characteris- 
tic of all machined surfaces or it may be isolated in nature such 


However, in some instances the surface is rough. 


as that produced by the poor mating of component parts or by 
nicks and scratches. In any case, if a surface irregularity pro- 
trudes « significant distance into the stream the minimum pres- 
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ber 18, 1958. Paper No. 59—Hyd-12. 


Nomenclature 


sure will be lowered below that for a smooth? surface; hence 
cavitation will result and the body will not perform as designed. 

There has been very little systematic work in regard to the in- 
cipient-cavitation characteristics of surface irregularities in con- 
trast to the extensive investigations conducted to determine their 
drag characteristics. Shalnev [1]* has investigated cavitation 
caused by surface irregularities in the clearance between the end 
of turbomachinery blades and the casing wall. However, he did 
not correlate his results with the properties of the boundary layer 
in the vicinity of the roughness. He also has reported on exten- 
sive tests of cavitation erosion behind irregularities [2], but in 
these tests the emphasis was on the advanced stages of cavitation 
rather than the incipient state which is the major concern in this 
investigation. 

More recently Calehuff and Wislicenus [3] and Walker [4] have 
conducted investigations of isolated surface irregularities in which 
the cavitation characteristics were related to the relative height 
of roughness h/d where h is the height of roughness and 6 the 
local boundary-layer thickness. To the author’s knowledge, 
these investigations represent the only previous work on isolating 
the effect of a surface irregularity. However, the investigations 
conducted by Calehuff and Wislicenus were of limited scope and 


? In the field of hydrodynamics a smooth body is usually defined as 
one which can sustain a laminar sublayer. However, it is not known 
whether this definition is adequate for the incipient-cavitation proper- 
ties of surface irregularities. 

3 Numbers in brackets designate References at end of paper. 
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A surface irregularity on a fiat plate 


Fig. 1 


exploratory in nature. Walker’s work though of great theoretical 
interest was also of limited scope. He was primarily concerned 
with the theoretical and experimental determination of the 
minimum-pressure coefficient for one circular-arc irregularity at 
one value of the relative height of roughness; namely, 0.25. 

An investigation was conducted to determine the incipient- 
cavitation characteristics of surface irregularities over a wide 
range of conditions so as to extend considerably the present 
knowledge. This paper is a summary of an earlier report [5] on 
the investigation. The most general problem or randomly dis- 
tributed surface roughness is not considered here. A problem of 
such complex nature would be extremely difficult to interpret 
without considerably more information on simpler flow struc- 
tures. 

This investigation is primarily confined to the study of isolated 
surface irregularities imbedded in the turbulent boundary layer 
of a flat plate. Two families of two-dimensional cylindrical 
roughness elements of constant cross section were investigated. 
The first. family had a circular-are cross section. The second 
family had a triangular cross section and was selected to simulate 
the separating flow which is typical of an actual surface irregu- 
larity. The height of roughness ranged from 0.01 to 0.5 in. for the 
circular-are family and from 0.002 to 0.5 in. for the triangular 
family. The relative height of roughness ranged from 0.014 to 
3.4 and the roughness Reynolds number from 370 to 300,000. 
Where it was feasible, theoretical methods were applied. The 
experimental program was conducted in three phases employing 
the three water tunnels of the Ordnance Research Laboratory of 
the Pennsylvania State University. 


General Considerations 


Consider the isolated surface irregularity imbedded in the tur- 
bulent boundary layer of a flat plate shown in Fig. 1. The onset 
of cavitation on such an irregularity is described by the incipient- 
cavitation number o;, given by 


P, — P, 

(1) 
2 


Nomenclature 


where P; is the free-stream static pressure at the state of incipient 
cavitation, V the free-stream velocity, P, the vapor pressure at 
the bulk temperature, and p the mass density of the liquid. 

The minimum pressure in the fluid Pmin is caused by the 
irregularity. If the irregularity is very blunt, such as the tri- 
angular elements used in this investigation, the minimum pressure 
will occur in the wake of the irregularity. On the other hand, for 
a streamlined element such as a circular arc, the minimum pres- 
sure occurs on the surface. In any case, one can define a mini- 
mum-pressure coefficient Cpmin given by 


where P is the free-stream static pressure. The minimum-pres- 
sure coefficient is taken as a measure of the minimum pressure in 
the noncavitating-flow regime. 

In the field of cavitation it is usually assumed that at the state 
of incipient cavitation the minimum pressure is equal to vapor 
pressure. Furthermore, it is assumed that small vapor bubbles do 
not alter materially the pressure field from that obtained for the 
noncavitating-flow regime. From these assumptions it follows 
that 


o; = Cpmin (3) 


Equation (3) is a statement of the fundamental theory of incipient 
cavitation, and it states that the incipient-cavitation number is 
equal to the minimum-pressure coefficient for the noncavitating- 
flow regime. 

In many instances the fundamental theory of incipient cavita- 
tion has proved to be correct. However, in recent years extensive 
experiments [6] on bodies in a uniform stream have indicated 
that the minimum pressure is often less than vapor pressure at the 
state of incipient cavitation. Thus in some cases 


< Cpmin (4) 


It is thought [7] that the departures from vapor pressure are due 
to so-called “time effects” associated with the growth of a vapor 


= roughness Reynolds number 
7 = water temperature 


u, V = velocity in rotational and ir- 6* 
rotational fields, respec- 4 

tively 6 

u, = velocity at height of rough- 


distance from origin to edge 
of rotational field 

= displacement thickness 

= complex variable 

= momentum thickness 

= thickness ratio for circular- Subscripts for Frozen Streamline Theory 


0;, = incipient-cavitation number 

of a smooth parent body 
ig = incipient-cavitation number 
of a rough parent body 


irregularities 0 = conditions along vertical line that 
= free-stream velocity v = kinematic viscosity passes through mid-point of 
X,Y = co-ordinates p = mass density gules ove 
Z = complex variable 0; = incipient-cavitation number ie 
a = air content of a surface irregularity © = conditions far upstream 
a, = saturated air content Cio = incipient-cavitation number 5) = conditions at edge of boundary layer 
a/a, = relative air content of a surface irregularity for X = 0 


6, 5. 


boundary-layer thickness 
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when h/6 = @ 


h = conditions at X = 0, Y =h 
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Fig. 2 Flow picture for frozen-streamline theory 


bubble to macroscopic size. Furthermore, it has been found that 
the difference between Cpmin and o; depends upon the scales of 
length and velocity of the system [6]. Because of this velocity 
and length dependency, departures from the fundamental theory 
of incipient cavitation are said to be due to so-called “scale 
effects.” 

The quantity Cpmin — 7; may be taken as a measure of the 
scale effects, for Cpmin — 7; = 0 implies that there are no scale 
effects whereas Cpmin — o; > 0 implies there are scale effects.‘ 
Consequently, one must know both Cpmin and o, in order to 
measure the scale effects. Furthermore, the minimum-pressure 
coefficient is subject to its own peculiar scale effects. The mini- 
mur-pressure coefficient for a body in a uniform stream is known 
to be some function of Reynolds number. Therefore, it is con- 
venient to divide the problem of scale effects into two parts: (a) 
Those scale effects which cause the minimum pressure in the 
noncavitating-flow regime to vary; and (b) those scale effects 
which cause the minimum pressure at the state of incipient cavita- 
tion to be less than vapor pressure. The former will be referred to 
as “scale effects on the minimum pressure’ and the latter are 
called “scale effects on the cavitation pressure’ where the “cavita- 
tion pressure”’ is defined as the minimum pressure at the state of 
incipient cavitation. It is evident from the foregoing definition 
that the cavitation pressure is given by 


pv °(Cpmin — 


Although in many instances one can theoretically determine 
Cpmin for a body in a uniform stream there are no adequate 
theoretical means of predicting o; for such flows. The problem is 
further complicated in the case considered here where the bodies 
are placed in a nonuniform flow. No attempt will be made to 
predict o; for this case, but an approximate theoretical determina- 
tion of Cpmin for the circular-are irregularities is presented in the 
following section. Before proceeding with this development a few 
remarks are in order concerning what variables might describe 
the minimum-pressure coefficient for a surface irregularity. 

Consider the irregularity shown in Fig. 1. One would expect 
that the pressure field would be dependent upon the location of 
the element in the boundary layer. Consequently, the relative 


* It should be noted that only vaporous cavitation which is cavita- 
tion caused by the vaporization of the liquid is considered in this 
criterion. Gaseous cavitation due to slow expansion of the bubbles by 
air diffusion can occur at pressures greater than vapor pressure in 
which case Cpmin — oi < 0. 


Journal of Basic Engineering 


height of roughness h/6 where h is the height of roughness and 
6 the local boundary-layer thickness should be an important 
variable. Also, the shape of the boundary-layer velocity profile 
would influence the pressure field. The boundary-layer shape 
parameter H = 5*/6, where 5* is the displacement thickness and 
6 the momentum thickness, will be used to describe the shape of 
the velocity profile. Since the pressure field may be influenced 
by the state of flow in the boundary layer, a Reynolds num- 
ber describing the motion should be employed. The rough- 
ness Reynolds number R, is selected for this purpose. This 
parameter is given by R, = u,h/v, where u, is the velocity in the 
boundary layer at the height of roughness and v the kinematic 
viscosity. Then the minimum-pressure coefficient may be repre- 
sented as 


Crmin = f(h/6, H, R,) (5) 


Theoretical Determination of C.....,.. for Circular-Arc 
Irregularities 


The minimum-pressure coefficient for circular-arc irregularities 
having a thickness ratio of \ = 0.175, where X is the ratio of the 
height of the irregularity to chord length, was determined by 
means of the frozen-streamline theory suggested by G. F. Wis- 
licenus.’ Walker [4] employed this theory in several cases in 
addition to the more exact relaxation method and found ac- 
ceptable agreement between these two methods and pressure 
measurements. 

The flow picture for the frozen-streamline theory is shown in 
Fig. 2. A wall is made to coincide with the stagnation stream- 
line of an infinite cylinder having a circular-are cross section. A 
nonuniform flow is assumed to exist far upstream. The fluid is 
nonviscous and incompressible. It is assumed that the form of 
the streamlines is unchanged from that of the uniform, potential 
flow about the cylinder. Because of this latter assumption, the 
theory is called the “frozen-streamline theory.” 

The flow is irrotational outside the boundary layer and rota- 
tional from the wall to the edge of the boundary layer. As shown 
in Fig. 2, the streamline through the point Y,, = 6, is the locus 
of boundary-layer thicknesses. The minimum pressure occurs at 
X =0,Y =h. 

The upstream velocity profile in the boundary layer is of the 
form 


§ Director of the Garfield Thomas Water Tunnel and Professor of 
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Fig. 3. Normalized minimum-pressure coefficient as a function of h/i.. (\ = 0.175) 


(6) 


where Y,, is the distance from the wall far upstream from the 
irregularity, 6,, the boundary-layer thickness at the upstream 
point, u., the velocity at Y,, and V, the velocity at 6... The 
boundary-layer shape parameter H = 6*/@ for the velocity pro- 
file given by Equation (6) is found to be 


2+m 


m 


H 


The expressions for 6* and @ are 


5 co 
f - 
0 


and 


= (9) 
0 


The expression for C pmin is derived in the Appendix and the re- 
sults of this analysis are given in Fig. 3 where the normalized 
minimum-pressure coefficient Cpmin/Cpming i8 shown as a func- 
tion of the relative height of roughness for several values of m. 
The coefficient Cpming is that obtained from the potential solu- 
tion; that is, when h/5,, = ©. For a thickness ratio of 0.175, 
Craine is 1.083. 


Variation of C,...;. With Reynolds Number 


The minimum-pressure coefficient of a surface irregularity can 
be expected to vary with Reynolds number. This problem is ap- 
proached by considering the variation of Cpmin with Reynolds 
number for a circular cylinder and for a sphere in a uniform 
stream. Certain qualitative results are suggested by this analysis 
which may be applicable to surface irregularities. 

For a body in a uniform stream as the Reynolds number R, 
decreases, experimental results show that Cpmin also decreases. 
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This result is explained by the fact that as the Reynolds number 
decreases the boundary layer becomes thicker causing an effec- 
tive increase in the radius of curvature with a resulting increase 
in pressure on the surface. For example, the minimum-pressure 
coefficient for a circular evlinder decreases from 2.8 at R, = 10° to 
0.9 at R, = 28. 

The flow about a sufficiently small irregularity buried in a 
laminar boundary layer, or perhaps in the laminar sublayer of a 
turbulent boundary layer, falls into the class of so-called “creeping 
motions” in which the viscous terms in the Navier-Stokes’ equa- 
tions become quite predominant. The question arises as to 
whether Cpmin continues to decrease as Reynolds number de- 
creases. In order to throw some light on this question, the mini- 
mum-pressure coefficients for cylinder and spheres in a uniform 
stream at very low Reynolds numbers were derived employing the 
approximate theories of Stokes and Oseen [8,9]. From Stokes’ 
solution it is found that for a sphere 


6 


Crmin = = (10) 


where R, is Reynolds number given by 


Vd 
R, = — 


(11) 
v 


and V is the free-stream velocity, d the diameter, and v the 
kinematic viscosity. Oseen’s solution for the sphere is 
6 
Cpmin = 1 + R, (12) 
A comparison of Equation (12) with Equation (10) shows that 
Oseen’s solution for Cpmin is greater than Stokes’ solution by 1. 
The application of Stokes’ method to the flow about an infinite 
cylinder does not give a physically meaningful solution, for the 
velocity does not approach the free-stream velocity at infinity, 
but is rather logarithmically infinite. However, Lamb [10] has 
applied Oseen’s method to a circular cylinder successfully. From 
reference [10] (page 615), the expression for the minimum-pres- 
sure coefficient is found to be 
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Crain = 


2.003 — log, R, 


Equations (10), (12), and (13) show a common characteristic; 
namely, that as a Reynolds number decreases Cp mj, increases.® 
This is just opposite to the variation typical of higher Reynolds 
numbers. Thus the question arises as to whether or not the in- 
crease Of Cpmin With a decrease in R, is a valid result since the 
theories are approximate ones. In answer to this question, exact 
pressure distributions for a circular cylinder at Reynolds numbers 
of 1, 10, 100, and 1000, were obtained from reference [12]. The 
pressure distributions given in reference [12] were obtained from 
the full Navier-Stokes’ equations by relaxation methods. From 
Fig. 7 of reference [12] (page 136), the minimum pressure-co- 
efficients are 3.1, 1.3, 1.1, and 1.5 for R, equal to 1, 10, 100, and 
1000, respectively. 

The minimum-pressure coefficients just presented together with 
the exact values and various experimental values obtained from 
references [13] through [16] are given in Fig. 4. The experimental 
data are shown by solid curves while the foregoing theoretical 
values are given by dashed lines. The exact solutions are shown 
by small circles. 

Referring to the Cpmin for a cylinder in Fig. 4 it is seen that in- 
deed the exact solutions show an increase in Cpmin for R, < 100 
The minimum- 
pressure coefficient for a sphere in Fig. 4 shows characteristics 


and tend to approach Lamb's solution for R, = 1. 


similar to the cylinder although exact solutions were not available 
for the lower Reynolds numbers. 

The results shown in Fig. 4 are particularly interesting from the 
standpoint of the incipient cavitation characteristics of surface 
irregularities.” 
height of roughness is very small, the local Reynolds number may 


These results suggest that even when the relative 


be such as to cause significant pressure reductions in the vicinity 
of a surface irregularity. 


* The quoted range of applicability of the corresponding drag co- 
efficients for these equations is R, < 1 for Stokes’ and Lamb’s solu- 
tions and R, < 4 for Oseen’s solution [11]. 

7 The results in Fig. 4 are probably typical of minimum surface- 
pressure variations but whether or not the minimum pressure in the 
core of a vortex would vary in a similar manner is open to question. 
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Fig. 4 Spectrum of for cylinders and spheres 


Description of Experiments 


The experimental program was conducted in three phases. 
Phases I and II were conducted in the 48-in. and 12-in. water 
tunnels, and Phase III was conducted in the 4.5-in. by 20-in. 
water tunnel. Descriptions of the apparatus are given in Figs. 5 
through 7. 

Phases I and II were similar in many respects. Circular-are 
and triangular irregularities were imbedded in a turbulent 
boundary layer at several stations along a flat plate installed at 
the central plane of the test sections. In these phases the height 
of roughness ranged from 0.004 to 0.5 in. In Phase III, a 0.002- 
in. triangular element was located at the mid-chord of a 5-in. 
NACA 16012 hydrofoil. A view of the hydrofoil with the ir- 
regularity at mid-chord is shown in Fig. 7. 

A schematic drawing of the Phase I and II apparatus is shown 
at the top of Fig. 5. A sketch of the cross sections of the two 
families of irregularities is shown at the bottom of Fig. 5. In 
Phase II, the irregularities spanned the entire width of the plate 
and were machined on strips which were inserted in slots in the 
plate. The irregularities employed in Phase I were bolted to 
the plate and did not span the entire width of the plate because 
ot mechanical difficulties. The span of the elements employed in 
Phase I were 6 in., 13 in., and 26 in. for the '/,.-in., '/«in., and 
'/-in. roughness elements, respectively. The spanwise ends of 
these elements were rounded off to avoid possible end effects. 

All of the irregularities were made of brass as was the flat plate 
utilized in Phase II. An aluminum plate was used in Phase I. 
In order to insure that the leading edge of the plate did not 
cavitate and thereby alter the boundary-layer growth, the leading 
edge was hand-finished to an airfoil shape. The cross sections of 
the leading edges were NACA 16010 and NACA 16005 for the 
Phase I and Phase II plates, respectively. The cross sections of 
the trailing edges were NACA 160075 for Phase I and NACA 
16005 for Phase II. 

An end view of the Phase I apparatus looking upstream is 
shown in Fig. 6. 
schematically in Fig. 5. In Fig. 6 roughness elements are 
shown at the three measuring stations for illustration only and 
were not tested in this arrangement, for all tests were confined to 
a single roughness element. 


This view corresponds to the end view shown 
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Fig. 6 End view of Phase | apparatus looking upstream 


The static pressure was measured at numerous points along the 
plates in Phases I and II. Static-pressure holes drilled in the 
surface of the plate were connected to manometer boards by 
means of plastic tubing brought out of the plate through the strut 
shown in Figs. 5 and 6. Pressures were measured at velocities 
from 20 to 60 fps in Phase I and from 20 to 70 fps in Phase II. 
Mechanical considerations prevented the Phase I tests from being 
conducted for velocities above 60 fps. Mean values of pressure- 
drop coefficients were obtained from numerous readings, and 
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Fig. 5 Description of Phases | and Il apparatus 
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Fig. 7 Phase Ill hydrofoil (NACA 16012) with 2/1000-in. triangular 
roughness 


were used in correcting data and in conjunction with boundary- 
layer traverses. 

In Phases I and II, boundary-layer traverses were conducted at 
the measuring stations along the center line of the plate. Three 
stations were employed in Phase I and two were used in Phase II. 
The axial location of these stations X is shown in Fig. 5. The 
uniformity of the boundary layer across the plate was checked at 
the rear station in Phase I and was found to be quite good. 

The relative velocity u/V, where u is the velocity a distance Y 
from the plate and V is the velocity at 6, was plotted as a function 
of Y on log-log paper. A straight line was fitted to the data for 
points within the turbulent boundary layer, and another straight 
line was drawn through the data points outside of the boundary 
layer. The intersection of these lines determined the bound- 
ary-layer thickness 6. In most cases, it was found that 
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the boundary-layer thickness corresponded to that value of Y for 
which u/V = 0.995. 

In general, the boundary layer reacted as that typical of a 
roigh plate, that is, the boundary-layer thickness did not vary 
with velocity. However, at the forward station in Phase I, the 
boundary-layer thickness at 20 fps was smaller than that at 30-60 
fps, indicating a transition from smooth to rough plate conditions. 
At the center and rear stations in Phase I, the boundary-layer 
thickness did not show so great a variation from 20 to 30 fps. 
Because of the possible uncertainties in the boundary-layer thick- 
ness around 20 fps, most of the cavitation tests were conducted 
from 30 to 60 fps. 

The results of the boundary-layer traverses are summarized in 
Table 1 and several examples of traverses are shown in Fig. 8. 
Referring to Table 1, the mean value of the boundary-layer thick- 
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Table 1 Summary of boundary-layer data—Phases | and li 


Mean Pre- 
value of dicted 
boundary- varia- 
layer tion, 
thickness per 
5, in. cent 
Puase I 
0.124 
0.147 
0.450 
0.453 
0.990* 
0.994 
Puase II 
Forward 
Rear 


* One test. 


ness 6° is given for each measuring station. The actual coefficient 
of variation is shown in each case, It compares favorably with the 
predicted coefficient of variation which was determined from a 
measure of the uncertainties in zeroing the total-head tube on the 
surface of the plate. The power-law exponent 1/m is also given 
in Table 1 together with the shape parameter H calculated from 
Equation (7). 

Following the plate calibrations, cavitation tests were per- 
formed. In Phase I, a series of check tests was conducted at the 
rear station to determine the effect of variations in dissolved air 
content @ on the onset of cavitation. These tests were conducted 
at air contents from 5 to 15 ppm.® The results indicated no 
significant variation in o; with a for the circular-arc irregularities, 
but the o; for the triangular family tended to increase with a. 
The Phase I cavitation data discussed in subsequent sections 
were those obtained at a = 5 ppm. At this air content, the air 
content relative to saturation a/a,, where a, is the saturated- 
air content at the inception pressure, was less than unity in most 
cases, and thereby prevented any great amount of entrained 
bubble growth. However, in Phase II it was felt that it was not 
necessary to run at low air contents since even at high values of a 
the water was fairly clear. Apparently this was due to the 
smaller time available for bubble growth by diffusion in the smaller 
tunnel. Most of the Phase IT tests were conducted at a = 12 
ppm. 


* A bar over a symbol indicates a mean value. 
* Moles of air per million moles of water. 
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In all tests, the onset of cavitation was defined as the dis- 
appearance of cavitation; that is, the velocity was held constant 
and the pressure increased causing the zone of cavitation to dis- 
appear. In Phase I, inception was determined both visually and 
acoustically. The visual method gave more repeatable results 
and was found to agree closely with the acoustic method for the 
circular-are elements. However, the acoustic definition gave 
smaller values of o; than did the visual method for the triangular 
elements. The cavitation data presented in subsequent sections 
were obtained by visual methods. 

From 5 to 10 determinations of the state of inception were 
made at each condition. From these data a mean value of the in- 
cipient-cavitation number &; was obtained. Only the mean 
values are used in subsequent discussions. In general, the varia- 
tion from the mean was about +3 per cent. 

Pressure distributions on the '/,in. and '/:-in. circular-are 
elements employed in Phase I were measured to see how closely 
Cpmin approximated ¢;. Because of mechanical difficulties it was 
not possible to measure the pressure distribution on the smaller 
circular-are elements. Pressure measurements were not made on 
the triangular elements since the minimum pressure occurs in the 
wake, and consequently the minimum surface pressure has no 
bearing on the onset of cavitation. 


Discussion of Results 


The results of the investigation are summarized in Figs. 9 


through 14. The measured values of &; for the circular-are ele- 
ments are compared to measured values of Cpmin in Fig. 9. In 
Figs. 10 through 13 the incipient-cavitation number for both 
families of irregularities is shown as a function of the relative 
height of rovghness for constant values of the boundary-layer 
shape parameter. Cross plots of the curves of Figs. 10 through 13 
are shown in Fig. 14 where &; is shown as a function of H for 
constant values of h/6. 

Referring to the data for the '/,-in. cireular are in Fig. 9, it is 
seen that the minimum-pressure coefficient and incipient-cavita- 
tion number are in fair agreement indicating that there were no 
scale effects on the cavitation pressure. In general, the state of 
incipient cavitation on the circular-are elements was characterized 
by small macroscopic bubbles which occurred near the top of the 
elements at the point of symmetry. The disappearance of cavita- 
tion across the span was fairly uniform in most cases. During 
the cavitation tests of the '/2-in. element at the forward station 
there was some indication of intermittent separation. White 
patches of cavitation could be seen near the top of the element in 
some instances. If separation did occur then the minimum pres- 
sure was in the fluid rather than on the surface. In general, for 
separating flows it is well known that the incipient-cavitation 
number is greater than the minimum-pressure coefficient based 
on the minimum surface pressure. This may account for the fact 
that G; is greater than Cpmin for the '/:-in. element at the forward 
and center stations as shown by the data to the right in Fig. 9. 
On the other hand, it must be realized that the accuracy of a 
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minimum-pressure coefficient determined from a_pressure-dis- 
tribution measurement is quite dependent upon how close a static 
hole may come to the actual pressure minimum. In general, this 
would cause the minimum-pressure coefficient to be smaller than 
the actual value. Hence this fact could also account for the dis- 
crepancy between the incipient-cavitation number and the mini- 
mum-pressure coefficient. 

The incipient-cavitation number is shown as a function of 
relative height of roughness in Figs. 10 and 11 for the circular ares 
and in Figs. 12 and 13 for the triangles. The Phase I and II data 
in each figure are for a constant value of the boundary-layer shape 
parameter H. The data for the rear station in Phase I are not 
used in Figs. 10 through 13 since there was no comparable value 
of H encountered in Phase II. However, some of the data ob- 
tained at the rear station in Phase I are employed in Fig. 14. 

First of all, comparisons will be made between the triangles and 
circular ares for Phases I and II; Phase III will be discussed 
subsequently. The data for the triangles, Figs. 12 and 13, are 
similar in many respects to the data for the circular arcs, Figs. 10 
and 11. It is seen that for both families of irregularities the in- 
cipient-cavitation number increases with an increase in the rela- 


tive height of roughness. Also there is a decided velocity-scale 
effect, for in most instances the incipient-cavitation number in- 


creases with velocity. 


This type of scale effect has also been 
noted for bodies in a uniform stream [6, 17]. 

It is to be noted that the variations of ¢; with velocity for the 
large elements are smaller than those for the small roughnesses. 
This suggests a size effect. However, for a given value of the 
shape parameter, the relative height of roughness was varied by 
changing the height of roughness. Thus it is not possible to 
separate entirely the effects of size from those due to variations 
in the relative height of roughness. It is thus not known whether 
the roughness Reynolds number is a significant parameter de- 
scribing the incipient-cavitation properties of surface irregulari- 
ties. 

The influence of variations in H on the incipient-cavitation 
number is shown in Fig. 14 which is a cross plot of the data of 
Figs. 10 through 13. In addition, data tor the rear station of 
Phase I, H = 1.29, are given in Fig. 14. It is seen that for a 
given value of relative height of roughness, the incipient-cavita- 
tion number decreases with an increase in the boundary-layer 
shape parameter. The theoretical values of Cpmis as obtained 
from the frozen-streamline theory are compared to the incipient- 
cavitation number for the circular-are irregularities. 

The theoretical values of C pai» for the cireular-are irregularities 
in Figs. 10 and 11 are from 20 to 40 per cent greater than the in- 
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cipient-cavitation number at 60 fps. Also a comparison of the 
theoretical curves of Fig. 3 with the measured values of Cpmin in 
Fig. 9 shows that the theoretical values are about 25 per cent 
greater than the measured values in most cases. Aside from 
other limitations, the frozen-streamline theory does not account 
for possible scale effects due to viscosity. In addition, this theory 
cannot be expected to predict the incipient-cavitation number if 
there are significant scale effects on the cavitation pressure. 

In contrast to the circular ares, the onset of cavitation occurred 
in the wake of the triangular elements, and was very sudden and 
explosive in nature, much like that described by Kermeen and 
Parkin [17] in their studies of sharp-edged disks in a uniform 
stream. There are no adequate theories for predicting Cpmin for 
such flows. However, a semiempirical analysis was made of the 
triangular elements and is discussed briefly in the section on 
Application of Results. 

The Phase III data shown by the cross marks are compared to 
Phases I and IT in Figs. 12 and 13. Since the 2/1000-in. triangu- 
lar element was located in the pressure field of the hydrofoil it was 
necessary to calculate the o; for the roughness when considered 
alone, that is when on a flat plate, by means of the equation 

Cin — Cr, 

1+ Cp, (14) 
where oj, was the observed incipient-cavitation number of the 
hydrofoil. This equation is discussed in the section on Applica- 
tion of Results. At the midchord of the NACA 16012 hydro- 
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Fig. 12 Variation of 3 with h/j for H = 1.33, triangles, Phases | and I! 


foil Cp, = 0.286 from reference [18]. Also, it was necessary to 
estimate the boundary-layer thickness in the region of the rough- 
ness. This was obtained from Equation (23) discussed in Ap- 
plication of Results. 

Referring to Figs. 12 and 13, it is seen that the Phase III data 
are in fair agreement with the Phase II data. (The Phase III 
data are shown in both figures since the H-value was not khown.) 
In all cases the Phase III data are higher than the Phase II data, 
and the incipient-cavitation number for the 2/1000-in. roughness 
increases quite rapidly with relative height of roughness. How- 
ever, for the 2/1000-in. roughness, the boundary-layer thickness 
was calculated rather than measured, and the incipient-cavitation 
number was calculated from the observed data by means of 
Equation (14). Thus both o; and h/é for the 2/1000-in. rough- 
ness are derived quantities. Therefore one cannot attach any 
great significance to the trends shown by the Phase III data. 
The greatest significance of the Phase III results is that they do 
agree fairly well with the Phase II data. Consequently, if the re- 
sults of Phase II had been used to predict the incipient-cavitation 
number of the hydrofoil oj, it is evident that the prediction 
would have been fairly good. 

In Figs. 10 through 13, the data for Phases I and II overlap for 
the !/is-in. elements. For the circular-arc elements it is seen that 
the Phase II data are from 5 to 10 per cent lower than the Phase 
I data in the region of overlap. In contrast to this, the Phase II 
data for the triangles, Figs. 12 and 13, are in fair agreement with 
the Phase I data in the region of overlap. Apparently, differences 
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in the two water tunnels employed in Phases I and II caused these 
discrepancies. 


Application of Results 


The results presented in the preceding section can be used to 
estimate the effect of an isolated surface irregularity on the in- 
cipient-cavitation number of a parent body. For example, as- 
sume that an isolated surface irregularity is in the low-pressure 
region of a hydrofoil. In order to predict the incipient-cavitation 
number of the parent body, four factors must be known: (a) The 
height of roughness; (b) the boundary-layer thickness in the 
vicinity of the roughness; (c) the pressure distribution on the 
body in the vicinity of the roughness; and (d) the incipient-cavita- 
tion number of the roughness when considered alone. 

The minimum-pressure coefficient for a parent body with rough- 
ness C pming i8 


(15) 


where Ping is the minimum pressure produced by the irregu- 
larity, and P,, and V, are the free-stream static pressure and 
velocity, respectively, of the parent body. Equation (15) also 
may be written as 


Journal of Basic Engineering 


C pming 


P. P ( 
1 

— ye 

2 Pl 2 

where P and V are the static pressure and velocity in the vicinity 

of the roughness, respectively. If Cp, is the pressure coefficient for 

the smooth body corresponding to the location of the roughness 

along the body z then 


(17) 


and (18) 


provided the roughness is sufficiently small. If Cpmin is the 
minimum-pressure coefficient of the roughness when considered 
alone, namely, when the roughness is on a flat plate, then 

P= Puing 


> Cpmin 


(19) 
1 

Vy: 
2 p 
Substituting Equations (17), (18), and (19) in Equation (16) 
results in 
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Fig. 14 


Cpming = Cp, + (1 + Cp,)Cpmin 20) 


Assuming that Puing = P, at the state of incipient cavitation 
then 


= Cre, + (1 + Cp,)o; (21) 


where oj, is the incipient-cavitation number of the body with 
roughness and @; is the incipient-cavitation number of the rough- 
ness considered alone. The roughness is most detrimental when 
placed at the minimum-pressure point of the parent body, that is, 
when Cps = Cpmin, Where Cpmin, is the minimum-pressure co- 
efficient of the smooth body. For this case 


Cin = Cpmin, + (1+ Cpmin,)%; (22) 


Equations (20) through (22) have been derived previously, for 
example, reference [3]. 

One can estimate the boundary-layer thickness for a turbulent 
boundary layer from the equation 


Equation (23) only applies to two-dimensional flow. In general, 
for a given distance x the boundary-layer thickness on a body of 
revolution is smaller than that for a two-dimensional body. Also 
Equation (23) is only strictly applicable to a smooth flat plate 
where the flow is turbulent from the leading edge. More refined 
procedures for calculating 6 are given in reference [11]. 

Since a triangular irregularity simulates the separating flow 
typical of actual irregularities, it will be emploved to illustrate the 
effect on the incipient-cavitation number of a body due to the 
presence of a surface irregularity. It is assumed that a surface 
irregularity is located at the minimum-pressure point of a hydro- 


(23) 
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Incipient-cavitation number as a function of H 


Table 2 Tabulation of sample calculations 
for Cpmins = 0.5,5 = 0.048 in. 


Per cent 

increase 
h, in. h/é Cin in Cis 
0.001 0.021 0.34 1.01 102 
0.002 0.042 0.45 1.18 135 
0.004 0.083 0.67 1.50 201 
0.008 0.167 0.90 1.85 270 


foil for which V.. = 50 fps. The minimum-pressure coefficient, 
Cpmin, i8 assumed to be 0.5 and the point of minimum pressure 
is 2 in. from the leading edge of the body. From Equation (23), 
the boundary-layer thickness is 0.048 in. for a water temperature 
of 75 F. The calculations are summarized in Table 2 for various 
heights of roughness. In order to obtain conservative estimates 
the uppermost curve in Fig. 12 was used to obtain the incipient- 
cavitation number of the roughness considered alone. 

In the right-hand column of Table 2 is shown the percentage in- 
crease in the incipient-cavitation number due to the irregularity. 
The incipient-cavitation number of a smooth body oy, is ap- 
proximately equal to Crpmin,. Then from Equation (22) the 
percentage increase in the incipient-cavitation number due to the 
roughness is equal to 100 (ai, — Cpmin,) divided by Cpmin,. 

Referring to the case where h = 0.001 in. in Table 2, it is seen 
that the incipient-cavitation number has been increased by 102 
per cent. It is evident from the calculations shown in Table 2 
that a small roughness can cause a very significant increase in the 
incipient-cavitation number. 

In some cases, one may wish to estimate o; for roughness ele- 
ments different from those considered in this investigation. A 
study of approximate methods indicated that one could estimate 
a, from the equation 
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where o;,, is the incipient-cavitation number of a roughness ele- 
ment when in a uniform stream, that is, for h/6 = @. Equation 
(24) was applied to the case of the triangular elements and was 
found to be in fair agreement with experiment. In this analysis 
Oi. Was obtained from the data for sharp-edged disks given in 
Fig. 8 of reference [17], where the incipient-cavitation number is 
shown as a function of Reynolds number. It was assumed that 
the height of the triangle corresponded to the radius of the disk 
and the Reynolds number of the disk, based on the radius, was 
then assumed to be equal to the roughness Reynolds number for 
the triangles. 


Conclusions 
The major conclusions of this investigation are as follows: 


1 The incipient-cavitation number of an isolated surface ir- 
regularity in a turbulent boundary layer is dependent upon the 
relative height of roughness, the boundary-layer shape parameter, 
the velocity, and other variables as yet unknown. 

2 For given values of velocity and boundary-layer shape 
parameter, the incipient-cavitation number increases with rela- 
tive height of roughness. Furthermore, the relative height of 
roughness appears to be the most significant variable for describ- 
ing the incipient-cavitation number. 

3 For given values of velocity and relative height of rough- 
ness, the incipient-cavitation number increases with a decrease 
in the boundary-layer shape parameter. 

4 In most instances, the incipient-cavitation number in- 
creases with velocity. This type of scale effect also prevails for 
bodies in a uniform stream. 

5 The variations in the incipient-cavitation number with 
velocity are due to scale effects on the minimum and/or the 
cavitation pressure. For the '/,-in. circular-are irregularity, the 
incipient-cavitation number and measured minimum-pressure co- 
efficient are in fair agreement. Hence in this instance there is 
no scale effect on the cavitation pressure. 

6 The per cent change of the incipient-cavitation number with 
velocity was least for the largest irregularities in most cases. 
This result suggests a size effect. However, for a given value of 
the shape parameter, the relative height of roughness was changed 
by varying the height of roughness. Thus it is not possible to 
separate the effects of size entirely from those due to variations in 
the relative height of roughness. 

7 If the incipient-cavitation number is dependent upon size 
as well as velocity, then it is probable that this is a Reynolds- 
number dependency. 

8 For the circular arcs, the minimum-pressure coefficient, de- 
termined by the frozen-streamline theory, was about 25 per cent 
greater than the measured minimum-pressure coefficient and 
was as much as 40 per cent greater than the incipient-cavitation 
number. 

9 When the boundary-layer thickness is 70 times greater than 
the height of roughness, the incipient-cavitation number for a 
triangular irregularity in a turbulent boundary layer can be as 
great as 0.3. Thus for comparable conditions the incipient- 
cavitation number of any smooth streamlined body of practical 
interest would be significantly increased by an irregularity lo- 
cated in the region of minimum pressure. 

10 For Reynolds numbers less than about 100, the minimum- 
pressure coefficients for cylinders and spheres in a uniform stream 
increase with a decrease in Reynolds number. Furthermore, the 
minimum-pressure coefficients attain very large values at low 
Reynolds numbers. These results suggest that even when the 
relative height of roughness is small, the roughness Reynolds 
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number may be such as to cause significant pressure reductions 
in the vicinity of a roughness when imbedded in a laminar 
boundary layer or a laminar sublayer. 

11 A study of approximate methods indicates that one can 
estimate the incipient-cavitation number of an isolated surface 
irregularity by assuming that the velocity at the height of rough- 
ness characterizes the flow in the boundary layer. 
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APPENDIX 
Derivation of Frozen Streamline Theory 


In the rotational field, u, p, and r will denote velocity in the X- 
direction, pressure, and radius of curvature, respectively. In the 
potential field, the corresponding symbols are V, P, and R. 
The subscript infinity (@) denotes conditions atX¥ = —. The 
subscript zero (0) denotes conditions at X = 0. Thus Y, is the 
distance from the X-axis at X = — o, and Yo is the distance 
from the X-axis at X = 0. 

Consider the flow between two neighboring streamlines from 
X =-otoX =0. At X = Oand X = the velocity is 
zero in the Y-direction. Thus from the continuity condition, one 
finds that for the rotational flow 


(25) 

and for the potential flow 

Vo = dY (26) 

Ve. 


where dY is the distance between the streamlines at the stations 
indicated. 


Equating Equations (25) and (26) and rearranging results in 


(27 
(27) 


The velocity in the Y-direction is zero at X = 0. Thus, from 


reference [19], the momentum equation” for the rotational flow 
is 


2 
(28) 
dro 


To 
where p is the mass density. Similarly for the potential flow 


dP, Vo? 
R, 
Since it was assumed that the form of a streamline is the same in 
both fields, it follows that ro and Ry are equal. Therefore from 
Equations (28) and (29) one finds that 


dpo uo \? 
30 
aPy ( 
Substituting Equation (27) in Equation (30) yields 
u. \? 
dp. = | —~) dP 31 
Po ( Ve ) ) (31) 


Integration of Equation (31) from Yo = h to Yo = de results in 


5p ue 
Pi — Pr = f (+>) dP (32) 
h Ve 


where pg, is the pressure at Yo = dy and p, the pressure at Yo = h. 
But p, is the minimum pressure Pmin since the maximum velocity 
occurs at Yo = h. Furthermore, at the edge of the boundary 
layer the pressure in the rotational field is equal to that in the 


irrotational field, i.e., p3, = Ps. Employing these substitutions 
in Equation (32) gives 


be 2 
Ps f (=) dP» (33) 
h 


‘© The radius of curvature is positive outward from the center of 
curvature. 
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Bernoulli's equation applies throughout the potential field 
At X = 0, Bernoulli’s equation is 


> pVo? + Po = const (34) 
The differential of the pressure is 
1 
dP, = — 2 pdV,? (35) 


Substitution of Equation (35) in Equation (33) for dP» results in 


| 
Px — Poin = — aV,? (36) 
A 


The points Y,, = 6, and Y» = 6) are on the streamline at the 
edge of the rotational field. Writing Bernoulli's equation between 
these points gives 


1 
Py = P. + pV.? — 
be + 2 


(37) 


Employing Equation (37) in Equation (36) for Ps, and rearrang- 
ing results in 


1 1 
— Pair = 2 pVa? — pV.* 


2 


Dividing Equation (38) by yields 


\2 
Cemin = f ( ) dQ+Q,-1 (39) 


Vo 2 V, 2 Ve 2 
The upstream velocity profile u../V,, is arbitrary in Equation 
(39). However, only the turbulent boundary-layer profiles given 


by Equation (6) were used here. Substitution of Equation (6) in 
Equation (39) gives 


Qh Y 2 
1 
Cpmia (7: ) dQ + 1 (41) 


Since Equation (41) is expressed in terms of variables defined for 
the potential field it can be solved by the well-known relations for 
the Karman-Trefftz transformation (20). This transformation is 


where 


given by 
where 

Z=X+iY (43) 
and 

f= E+ (44) 


It can be shown that a circle of radius a in the {-plane is trans- 
formed into a circular-arc profile in the Z-plane by means of Equa- 
tion (42). 

Equation (41) was integrated graphically for the circular-arc 
profile having a thickness ratio \ of 0.175. In order to carry 
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out this integration, Q was initially determined in the ¢-plane for 
the circular cylinder. The complex potential W for a circular 
cylinder is 


(45) 


Letting Q’ denote Q in the [-plane and arbitrarily setting a = 1, 
then from Equation (45) it is found that 


1 

No 
where m and Y,, are related by the equation 


3 + (47) 

and m is the distance trom the £-axis at & = 0. Equation (47) 

was obtained from the expression for the stream function; namely, 
the imaginary part of Equation (45). 

The velocities in the Z and {-planes are related by the equation 


dw dF 48 
ad aZ 
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Employing Equations (42) and (45) in Equation (48) results in 


(> + 4 (#) 


where m and Y» are related by the equation 


Yo = n tan E arctan % — > (n — »| (50) 


Knowing Y,, the variable m is given by Equation (47), and 
then Q’ and Y> can be determined from Equation (46) and Equa- 
tion (50), respectively. Following this series of calculations, Q 
can be obtained from Equation (49). If one desires the velocity 
profile over the circular arc, then it is evident from Equations 
(27) and (40) that 


(51) 


By employing a variable transformation for the case A = 0.5, 
that is, a semicircular profile, it was possible to obtain analytical 
solutions of Equation (41) for m = 1, 2, and 4. In addition, an 
approximate expression for Cpmi, was determined for A < 0.5, 
m > 2, and h/6, <1. These derivations are given in reference 


{5}. 
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Indentation of Metals by Cavitation 


The first stage in cavitation damage—indentation where cavitation bubbles collapse near 
the metal surface—was observed and measured by Knapp |1|.' The author suggested 


that the high temperature of indentation points, caused by the absorption of work in 
their formation, should create conditions favorable to energetic local chemical action 
between water and metal. The brittle oxide films thus produced would be ruptured by 
later blows, resulting in erosion. Photomicrographs taken in monochrome and color 
show the form and texture of indentations with associated corrosive oxidation. Parallel 
experiments were made with toluene whose inertness permitted the elimination of chemi- 
cal factors. The spectal features of the microscope technique employed to get the photo- 


A. ATTEMPT HAS BEEN MADE to see how cavitation 
damage on a metal surface actually develops. That is, to get a 
visual record of the stages of alteration starting with a nearly per- 
fect surface and observing, first the individual indentations, and 
subsequently their overlap and the later consequences. 

Indentations in aluminum, produced by cavitation in water and 
toluene, have been examined and photographed. With the same 
amplitude of vibration, which is a measure of cavitation severity 
for a given liquid, individual indentations turn out to be generally 
similar in the two liquids although perhaps somewhat larger in 
toluene, but the number of indentations and the extent of over-all 
damage is much greater with water. 

In those areas where indentation has occurred repeatedly, rup- 
ture and loss of metal from the surface can be observed. There 
are indications that this takes place by the ejection of metal from 
repeatedly indented areas because of one later severe blow and, 
secondly, by the breakaway of fragments of metal from crater 
rims thrown up around indentations. Chemical reaction be- 
tween water and aluminum probably takes place at rupture points 
but the evidence is insufficient to show whether this is a prime 
factor in rate of loss of metal. 

Indentations in harder metals have been observed, most suc- 
cessfully with cold-rolled pure iron in distilled water. Interesting 
differences are evident and it is apparent that chemical action 
has occurred between the water and the iron at the indentations, 
and also that preferential corrosion has taken place at many small 
points in the metal, which may be strain points. 

The cavitation damage was produced by a magnetostriction 
apparatus. The experimental procedure included examination 
of the damaged specimens using the special technique of polarized 
light combined with a “whole-wave” or “sensitive-tint” plate, 
whereby changes in contour and surface texture are enhanced by 
variations in color and tint instead of being rendered only in the 
normal light and shade of black and white. Use was made of 
photomicrographs taken on color film as well as the black and 
white pictures reproduced here. 


General 


Knapp [1]! reported the observation and measurement of in- 
dentations in annealed aluminum at points where cavitation 
bubbles collapsed near the metal surface. This observation im- 


' Numbers in brackets designate References at end of paper. 

Contributed by the Subcommittee on Cavitation of the Hydraulic 
Division and presented at the Hydraulic Conference, Ann Arbor, 
Mich., April 13-15, 1959, of Tae American Society OF MECHANICAL 
ENGINEERS. 

Notre: Statements and opinions advanced in papers are to be 
understood as individual expressions by their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Decem- 
ber 31, 1958. Paper No. 59—Hyd-15. 
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graphs are explained. 


parted realism to the first stage of what previously had been a 
purely theoretical damage sequence. The ultimate—severe 
erosion due to cavitation—had often been observed, but this first 
step in the process had not. The thought was not new but the 
practical demonstration was extremely stimulating to further 
conjecture. 

It happened that at about the same time [2] the idea was put 
forward that the rise in temperature which must occur at 
the instant of indentation, from the absorption of work by the 
metal, would give rise to energetic, though transient, chemical 
reaction between the water and the metal. A variety of secondary 
consequences could ensue when later blows fell on the same area, 
among them the rupture and detachment of brittle oxide films, 
and the exposure of new metal to repeat the process. This would 
explain one form of erosion, possibly the dominant form in cer- 
tain cases. In parallel, it was postulated, loss of metal into solu- 
tion would take place as a result of electrolysis between strain 
points created by each deformation (which would be anodic), and 
the body of the metal. This effect would be significant only if 
the liquid were ionized. 

A measure of experimental support for this hypothesis was 
subsequently obtained. In particular it was established by 
chemical assay that particles eroded from ferrous specimens con- 
tain a substantial proportion of oxide, and the local electrolytic 
action referred to has also been firmly established [2]; but, at- 
tempts to measure the transient temperatures attained at inden- 
tation points met with only indifferent success. 

The thought then developed that if the appearance of actual 
indentations caused by cavitation could be examined this might 
stimulate ideas regarding other possible methods of investigating 
indentation temperatures. Accordingly, it was decided to pursue 
Knapp’s observations of visible indentations in aluminum. This 
would permit, in addition to the opportunity for examination of 
individual indentations, visual observation of what happens when 
the indentations overlap and when several indentations fall suc- 
cessively on one spot. Whatever it is that occurs, in conjunction 
with the plastic deformation which takes place in the process of 
indentation, to cause detachment of material from the surface 
might then become visually evident. 

It was decided to make a comparison between toluene and 
water at the same time. It had been suggested in 1954 [3] that, 
for cavitation-testing purposes, toluene might be considered as a 
chemically inert counterpart of water, having nearly the same 
vapor-pressure curve, with other cavitation characteristics not 
too dissimilar, but possessing no reactivity with metals. If the 
argument is valid, the fact that the rate of erosion with toluene 
is spectacularly reduced, compared with water, would indicate 
that chemical combination of the water with the metal is a 
dominant contributor to erosion in water. This was, indeed, 
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one of the arguments leading up to the conjectural mechanism 
mentioned earlier. The alternative that, for hydrodynamic 
reasons, toluene does not produce an equivalent blow when cavita- 
tion bubbles formed in it are collapsed, obviously could not be 
disregarded and experimental confirmation was necessary. The 
indentation of aluminum seemed a good starting point for this 
inquiry. 

Depending on the results obtained with aluminum, harder 
metals would be investigated following a similar pattern, with the 
aim of discerning, by visual means, something more of the 
mechanism by which the collapse of cavitation bubbles leads to 
severe erosion of many metals. 


Experimental 


Cavitation Apparatus, Specimens, and Liquids. Cavitation was 
produced by a nickel magnetostriction vibrator [2] operated at 
8000 cps. The amplitude of vibration was 0.03 mm giving rise to 
relatively severe cavitation. 

The specimens were flat faced, 1 cm in diam, of the type asso- 
ciated with this kind of apparatus. The aluminum specimens 
were made by soldering a disk cut from 22 swg 99.99 per cent 
purity aluminum sheet to the working face of a standard specimen 
body at 250 deg C. The face was then electropolished in perchloric 
/acetic acid solution. The hardness was about 27-30 Vickers 
diamond-pyramid hardness. The iron specimens? were machined 
from cold-rolled material of VDH 165, mechanically polished to a 
good microfinish by normal methods. 

The water and toluene used with the aluminum specimens were 
rendered oxygen free by boiling for 30 min in a nitrogen at- 
mosphere, and were manipulated in nitrogen. The liquid used 
with the cold-rolled-iron specimens was ordinary distilled air- 
saturated water. 

Microscopy. A brief explanation is given of the special features 
of the microscope technique employed. 

A standard polarizing microscope was fitted with a “whole- 
wave” or “sensitive-tint’’ plate [4], placed immediately behind 
the polarizer in the vertical illuminator. This component, ordi- 
narily used to provide a sensitive means of detecting double re- 
fraction or anisotropy in specimens through which light is trans- 
mitted, also shows, by alterations of tint or color in the field, 
any changes in polarization introduced by a specimen viewed 
under reflected light. 

Generally speaking cubic metals—for example Fe, Cu, Al—do 
not produce any effect on polarized light except in the sense that 
oblique reflection from any surface gives rise to elliptical polariza- 
tion. This is to say that they are optically isotropic. But, it is 
known that they can be made to react by deep etching, for ex- 
ample [5], or by the deposition of some film on the surface. 

The effect of apparent anisotropy produced by deep etch pits is 
attributed to double oblique reflection from abutting crystal faces 
perpendicular to each other, exposed by the etching. Each 
oblique reflection affects the polarization. The same effect would 
be produced by rupture at natural cleavage planes, which 
with a cubic metal would also expose abutting perpendicular faces. 
Therefore, examination of a cavitation-damaged surface under 
crossed polars (without sensitive-tint plate) should show fracture 
areas bright, upon a dark ground. The two pairs of photomicro- 
graphs at Fig. 1 are included to show this effect. 

It will be observed that very little new information is dis- 
closed, the appearance under crossed polars being almost a nega- 
tive of the appearance under normal illumination. Inclusion of 
the sensitive-tint plate modifies this situation by disclosing varia- 
tions in color of the viewed image for every variation in degree of 
change in polarization. Slight changes in tint are associated with 

? Cold-rolled to 50 per cent reduction in thickness: C 0.04; Mn 
0.08; Si 0.05; S&P 0.02. 
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the small degree of ellipticity caused by a gentle variation in slope 
but a marked change of color, even to the complementary shade, 
is recorded for the large degree of ellipticity introduced by double- 
oblique reflection at perpendicular crystal faces. 

In addition, any film of oxide formed on the surface may be 
capable of affecting the color of the viewed image if the polariza- 
tion is affected either by reflection at the oxide surface or by 
passage through it and reflection on the metal surface below. In 
the case of iron, for example, Fe,O; is known to be anisotropic, 
that is, it affects the direction of polarization. FeO and Fe,O, are 
considered to be isotropic, that is, noneffective. 

The manipulation of the optical and photographic apparatus is 
fairly normal but there are certain points needing special atten- 
tion. 

The amount of light reaching the eye, or camera, is greatly 
reduced by the polarizing train and an abnormally high level of 
illumination is essential to avoid unduly long exposure times, 
during which a shift of focus can occur. Special care must be 
taken to achieve complete heat filtration which, if neglected, 
likewise causes a shift of focus. For the present experiments a 35- 
mm slide projector was substituted for the ordinary microscope 
lamp and a 100-watt-projector lamp, overrun at up to 20 per cent 
overvoltage, was employed. The whole of the issuing light was 
focused into the vertical illuminator. Two heat filters were used 
in the train, ahead of the polarizer. 

The second special point relates to focusing. On flat, un- 
damaged areas there is, of course, no difficulty. But if the speci- 
men is traversed at this same focus setting it is unlikely that any 
indentations will be discerned. Wherever damage is seen the focus 
must be very carefully shifted toward, and away from the sur- 
face, and the eye sometimes moved askew. The magnification 
should also be varied, preferably by changing the objective. 
Gradually the observer will become practiced in recognizing in- 
dentations. Even so, as the eye becomes tired, or the focus is 
shifted ever so slightly, the appearance of indentations can be 
lost. Eventually the difficulty disappears, but for successful 
pictures the focus must be correct to within half a micron. 

The third point concerns the polarizing train and whole-wave 
plate. A start may be made by getting the analyzer correctly 
adjusted in relation to the reflecting glass of the vertical il- 
luminator; that is, giving minimum illumination when the 
polarizer and the whole-wave plate are out of the train. 
The polarizer may now be inserted, in the crossed position, giving a 
dark field when viewing a clean flat isotropic surface. The whole- 
wave plate is now introduced, with its axis at 45 deg to the polar- 
izer and analyzer. The field will now appear cerise in color. 
With this setting the indentations will be much more easily dis- 
cerned than with the ordinary microscope illumination. The 
operator is now at liberty to adjust as he finds it convenient. The 
procedure used for these experiments was to shift polarizer and 
whole-wave plate together, leaving the analyzer undisturbed. 

Color Variations. It is important to understand that the basic 
color of the field with a flat isotropic surface is a function of the 
relative positions of the components in the polarizing train. The 
setting used is a matter of convenience and individual preference, 
and the basic color rendering is purely dependent on the setting. 
But color changes within the field, when indented or damaged 
areas are being viewed, are caused by the damage. Therefore it 
is color changes which must be studied. Different pictures of the 
same area may be basically green, or yellow, or pink, simply due 
to variations in adjustment; but variations from green to orange, 
or cerise to green, over the field of view, are due to significant 
changes in the surface under examination. 


The results of the investigation take the form of photomicro- 
graphs which are reproduced in Figs. 1-9. 
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Fig. 1 illustrates the appearance of a cavitation-damaged 
specimen under ordinary polarized light. This is desirable as 
a reminder of the special technique adopted for most of 
the other figures, for which a whole-wave plate was included 
in the optical train, with the polarizers. Fig. 2 is included pri- 
marily to illustrate the difference in the extent of damage 
caused, with aluminum, by water and toluene, respectively, at the 
same vibration amplitude. The two photographs were taken 
without polars or whole-wave plate, using a X1.7 objective and 
x8 eyepiece. Illumination of the specimen is difficult at such low 
magnification and the main character is the general appearance 
of the damage. 


(a) Slightly dumeged 


9 

(c) Severely damaged surface under normal microscope illumination 
Fig. 1 Annealed high purity iron with polarized light (320) 
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Fig. 1 Annealed high-purity iron. VDH 80. Mag. X320. Two 
areas of a cavitation-damaged annealed iron specimen showing the 
appearance with normal microscope illumination, paired with 
the appearance when viewed through crossed polars. Not much 
additional information is disclosed by the use of crossed polars. 
But it is possible to distinguish, for example, between the dark 
and light pattern attributable to small, close indentations visible 
on a few crystals under normal illumination, which do not ap- 
pear bright under crossed polars, and dark areas and lines which 
represent rupture areas and cracks, which do appear bright under 
crossed polars. As always, the polars can only function with full 
effectiveness over the central areas of the pictures, running N-S. 


(d) Same area as (c) under crossed polars 
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(a) Surface of pure aluminum after 60 sec cavitation in water 


(b) Surface of pure aluminum after 120 sec cavitation in toluene 


Fig. 2 Pure aluminum (X14) 


Fig. 2 Soft-temper pure aluminum. VDH 27-30. Mag. X14. 
Near-central, damaged areas of electropolished pure-aluminum 
specimens, after being subjected to relatively severe cavitation for 
60 sec in water and 120 sec in toluene, respectively, both liquids 
oxygen-free. Furrows are due to imperfect electropolishing con- 
ditions and were retained to give greater light and shade in the 
photographs, whereby the indentations are more easily observed. 
The water has produced substantially more damage than the 
toluene, at the same vibration amplitude, in half the time. It is 
known [6] that abrasion of aluminum immersed in water pro- 
duces hydrogen as long as abrasion continues, but hydrogen 
evolution stops quickly when mechanical disruption is stopped 


(a) Multiple indentations in aluminum produced by cavitation in water 
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because of the formation of a new protective film. This effect will 
give rise to a difference between water and toluene in cases where 
rupture of the aluminum surface once occurs. However, present 
evidence is not sufficient to show whether this is the significant 
effect in producing the greater damage with water than with 
toluene which takes place at the high cavitation severity used in 
these experiments. 

Fig. 3 Sofi-temper pure aluminum. VDH 27-30. Mag. 320. 
Multiple indentations without appreciable rupture, Fig. 3(a) in 
water and Fig. 3(b) in toluene at higher magnification than Fig. 2. 
The shape of the indentations is not regularly spherical which 
may be attributable to departure of the collapsing bubble from 


(b) Multiple indentations in aluminum produced by eaviaiion in toluene 
Fig. 3 Pure aluminum (X 320) 
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(a) Severe rupture damage caused by cavitation in toluene 


(b) Transition from indentations to early rupture damage, in toluene 


Fig. 4 Pure aluminum (X80) 


the spherical shape, photographed by Ellis [7], as well as second- 
ary deformation of early indentations by later ones. Compari- 
son of these two pictures, taken and reproduced at the same 
magnification, shows a general increase in size for indentations in 
toluene compared with water. The crack in the toluene specimen 
may have started asa scratch. The inclusion of these pictures of 
indentations at higher magnifications, at this stage in the se- 
quence, will enable the reader to become familiar with the de- 
tailed appearance of multiple indentations. 

Fig. 4 Soft-temper pure aluminum. VDH 27-30. Mag. X80. 
Rupture damage adjacent to indentations, both in toluene. Fig. 
4(a) is the badly scarred area in the upper left, NW, portion of 


(a) Multiple indentations without rupture damage in toluene [for compari- 
son with (b)] 


Fig. 2(b) and Fig. 4(b) is an area to the right of the scarred area. 
The loss of metal from the scarred area puts some parts out of 
focus which defeats one of the objects of the picture, viz., inspee- 
tion of details in the transition from multiple indentations to 
actual rupture from the surface. The area covered by Fig. 4(b) 
suffers less from the focussing difficulty, the rupture damage being 
less advanced, but the undulations in the damaged SW portion 
prevent photomicrography of much of the detail. The indenta- 
tions adjacent to the rupture damage are well reproduced in both 
pictures. Color slides of both areas have been made and these are 
somewhat more informative. 

Fig. 5 Soft-temper pure aluminum. VDH 27-30. Mag. X200. 
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(b) Multiple indentations merging into ides damage in toluene 


Fig. 5 Pure aluminum (200) 
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yt rupture damage in toluene [for com- 


(a) Multiple ind 
parison with 


(b) Rupture damage superimposed on indentations in toluene 


Fig. 6 Pure aluminum (320) 


These photographs at higher magnification than Fig. 4 (but lower 
than Fig. 3) represent a further attempt to observe details of the 
transition (in toluene) from multiple indentations to rupture 
damage. Fig. 5(a) shows indentations alone and Fig. 5(b) shows 
an area where rupture damage is increasing toward the top right 
(NE) corner of the picture. Details of the nature of the transition 
are not well brought out, although more can be observed when 
viewed through the microscope. The main usefulness of these 
pictures is in showing the general appearance of indentations and 
the start of rupture damage at a different magnification. 

Fig. 6 Soft-temper pure aluminum. VDH 27-30. Mag. X320. 
A further comparison of indentations and rupture damage in 
toluene at higher magnification (same as Fig. 3) from which in- 
teresting information can be gleaned. Fig. 6(a) is similar in 
character to Fig. 3(b), showing essentially multiple indentations, 
in another area. But Fig. 6(6) discloses new details. If the 
optical arrangement with polars and whole-wave plate is re- 
called it will be remembered that rupture points should be 
characterized by marked changes of color in the field of observa- 
tion. The base color of the field, when both 6(a) and 6(b) were 
photographed, was pinky-mauve, but in the case of 6(b) there were 
a number of small areas colored bright yellow/green. The 
“glow’’ areas visible in Fig. 6(b) were the ones showing these 
colors. When the microscope was adjusted very gently up and 
down it became evident that these glow areas coincided with 
the bottom of certain indentations and the crests of some erater 
peaks, indicating that at these points rupture of metal from the 
mass had occurred. A possible explanation of this is given later. 
A color slide of the area has been made. 

Figs. 7-9 show the appearance of the mechanically polished cold- 
rolled iron surface after 80-min cavitation in ordinary distilled 
water. The characteristics to which particular attention is 
drawn are the oxidation/corrosion associated specifically with 
indentations, the numerous corrosion spots, and cracks or corro- 
sion associated with crater rims. The indentations are also 
different in appearance from those seen in aluminum. Apprecia- 


Journal of Basic Engineering 


ble erosion of the specimen took place during the 80-min cavita- 
tion, resulting in a main pit of diameter rather less than 1 mm 
surrounded by what may be described as a “coked”’ area, 
again surrounded by a halo of temper colors. This is the normal 
appearance of iron specimens after severe cavitation for this period. 
The photographs do not show the pit, which is of no special in- 
terest in the present context. They were taken at various 
points on the coked surround. 

Fig. 7 Cold-rolied high-purity iron. VDH 165. Mag. 200. 
These two photographs of the cold-rolled-iron surface are re- 
produced to compare, at moderate magnification, the appearance 
of an almost unindented area which has suffered corrosion, Fig. 
7(a), and a severely indented area adjacent to the main erosion 
pit, Fig. 7(b). The faint “orange-peel’’ mottle in the background 
of Fig. 7(a) is recognizable, with practice, as widespread but very 
mild indentation. It is more clearly discernible in the microscope 
with slight variation of focus. 
of Fig. 7(a) is the corrosion. 


However, the main characteristic 
Its mode of formation is not thought 
to be necessarily significant in relation to cavitation, deriving 
more probably from local conditions of the specimen, and the 
picture is reproduced to emphasize the completely different nature 
of the damage illustrated in Fig. 7(b) where the characteristic 
appearance of multiple indentations is easily recognizable. Im- 
portant differences, compared with aluminum are apparent. 
First, a sort of grain in the indentations (like the grain in wood) is 
seen running approximately NE-SW. This is presumed to be 
attributable to the direction of rolling of the metal. Further, it 
is seen that the area under consideration has suffered a number of 
blows of sufficient severity to produce large indentations, but 
many smaller indentations are superimposed on already-deformed 
areas. This effect is apparent with the aluminum but is more 
noticeable with the iron. In the NW area of the picture 
there are numerous cracks associated, as can be seen by refocus- 
ing, with the rims of craters. These may develop when crests 
coincide with natural grain boundaries. 

Fig. 8 Cold-rolled high-purity iron. VDH 165. 


Mag. X320. 
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(a) Almost unindented cold-rolled iron surface which has suffered 
corrosion, in water 


(b) Severely indented area of same specimen showing different 
appearance, in water 


Fig. 7 High purity iron: cold rolled (200) 


(a) Showing appearance of indented area when properly focused, in water 


(b) Same area as (a) with focus shifted 2 microns away, in water 


Fig. 8 High purity iron: cold rolled (320) 


The purpose of this pair of photographs of an area near the SW 
corner of Fig. 7(b) (at higher magnification) is to show the al- 
teration in appearance caused by a small change in focusing. 
The point of this is that it may help to explain why multiple in- 
dentations of the nature now recorded do not seem to have been 
photographed previously. Unless the correct point on the slope 
of the indentations is found for focusing, they are fuzzy and un- 


190 / mMarcH 1960 


recognizable. However, the present optical arrangements are 
probably the main factor in seeing the indentations clearly. For 
Fig. 8(b) the X 40 objective was moved 2 microns away from the 
specimen, compared with Fig. 8(a). The objective aperture was 
0.85 and the illumination stop was adjusted to fill two-thirds of 
the diameter of the rear-opening. The camera position was un- 
changed. 
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Fig. 9 High purity iron: cold rolled (o, X 320) (b, X 720) 


Fig.9 Cold-rolled high purity iron. VDH 165. Mag. (a) 320; (b) 
720. Fig. 9(a) is reproduced at a magnification of X320; Fig. 
9(b) at 720. The purpose of both photographs is to show the 
specific association of corrosive oxidation with individual indenta- 
tions. Both areas have also been photographed in color and the 
effect is again well shown. In Fig. 9(6) almost every indentation 
shows clear signs of corrosion and the same effect can be recog- 
nized, at lower magnification, over the whole area covered by Fig. 
9(a). Itis hard to reach any other conclusion than that oxidation 
occurs as a secondary effect of the indentation. Another feature 
of the photographs is the occurrence of many small black spots. 
They are particularly noticeable in Fig. 8(a), but are present also 
in Figs. 9(a) and 9(b). The same effect has been observed with 
annealed iron and with other cold-rolled iron specimens, after 
they have been subjected to cavitation. It is suggested that they 
represent strain points, developed by the cavitation blows, where 
corrosion occurs preferentially on account of their anodic nature 
with respect to the rest of the specimen. 


Discussion 


Comparison Between Water and Toluene. The results illustrated 
in Fig. 2 make it clear that, with the same vibration amplitude, 
the number of cavitation impacts sufficiently powerful to cause 
equivalent indentations in aluminum is several times larger for 
water than for toluene. A more complete comparison, in which 
the part played by chemical combination between water and 
newly exposed aluminum metal would need to be separated, re- 
quires the use of a noble metal (platinum or gold) incapable of 
chemical reaction with water. By this means the two liquids 
could be compared quantitatively from the purely mechanical 
point of view and equivalent vibration amplitudes determined. 
This would be useful in providing a proper basis for experiments 
with other metals using toluene to eliminate chemical reaction. 

Possible Mechanisms of Rupture. The effects illustrated in Fig. 6, 
where rupture of metal from the mass is seen to occur at the bot- 
tom of certain indentations and at the crests of some crater rims, 
suggest two possible mechanisms: 
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1 When a metal is deformed by a spherical indenter the maxi- 
mum value of the shear stress occurs at a point below the surface 
of the metal—not in the surface. The distance below the 
surface is about half the diameter of the indentation circle when 
under load—Timoshenko (1934), quoted by Davies (1949) [8]. 
This means that plastic deformation commences at a point cen- 
trally below the surface of each indentation. Consequently work- 
hardening is at a maximum below the surface of the indentation 
and this will give rise to the following results. The stress required 
to produce further yielding will increase and, secondly, the elastic 
recovery after the stress is removed will be correspondingly greater 
for the metal centrally below the surface of the indentation. It is 
this second point which seems to be important because it means 
that the effect of one final blow, after several blows in succession 
on one area, may be a sufficiently vigorous elastic recovery below 
the surface to cause cracking at the surface and possibly to ‘“ex- 
plode” off actual fragments of metal. The mechanism would 
have something in common with the ejection of metal from the 
end of a bar struck with a sharp blow, due to reflection of the 
shock wave from the end of the bar. In this case the rebound of 
the elastic core is the ejector mechanism. 

2 With work-hardened materials particularly, there is an up- 
ward extrusion of displaced metal to form a raised crater rim 
around an indentation. The effect is known as “piling up.”’ 
Even with an annealed material work-hardening will occur rela- 
tively soon. The acute bending of the metal at a sharp crater 
rim, may, of itself, give rise to cracking, as indicated in the NW 
area of Fig. 7(b). Secondly, when several indentations are closely 
contiguous, fragments of the piled-up material may be extruded 
between them and become detached. 


These two mechanisms may play an appreciable part in the 
physical removal of metal as a result of cavitation. 

Corrosive Oxidation Due to Temperature Rise. The suggestion 
made earlier, that the rise in temperature at indentation points, 
caused by the absorption of work, creates conditions favorable 
to energetic local chemical action between water and metal is 
strongly supported by the appearance of the cold-rolled iron sur- 
face illustrated in Fig. 9. 
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(ec) Showing apparent corrosion at nearly every indentation point,in water (b) Corrosion at indentation points, seen at higher magnification, in water : sie 
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An indication of the temperature rise to be expected may be 
obtained from consideration of the amount of work absorbed, in 
relation to the heat capacity of the metal displaced. Knapp [1] 
arrived at a figure of 60,000 in-lb per cu in. of indentation volume 
for annealed aluminum, by calculation based on tests with a 
microhardness indenter. If the assumption is made that 85 per 
cent of the work appears as heat [9] and that at the instant when 
the indentation is completed the heat is confined within a volume 
of metal equal to the indentation, the instantaneous temperature 
rise for annealed aluminum is 137 C. The corresponding amount 
of work for other metals will be very nearly proportional to their 
hardness, compared with pure aluminum. Knapp’s figure 
probably related to VDH ~ 20 while the VDH figure for 
the cold-rolled iron used in these experiments was 165. Hence, 
the work absorbed per unit volume with the iron is about 5 x 10 
in-lb and the corresponding temperature rise, allowing for the 
different density and specific heat, is 635 C. 

An analysis of the temperature situation after completion of 
the indentation, using the equations of Carslaw and Jaeger [10] 
and assuming that the heat pulse is delivered in a period of 1 
microsec, shows that in 4 microsec the temperature will have fallen 
to about half, and in 9 microsec to about one-third of the original 
value. This occurs mainly by conduction of heat into the mass. 

Now the condition of the iron surface at the point where an 
indentation has just occurred will be that any previously existing 
oxide film is cracked, possibly dislodged, with resultant ex- 
posure of new metal and probably dislocations. The work of 
Wickert & Piltz and Douglas & Zyzes [11] shows that in these 
circumstances, which resemble tests with iron powder or freshly 
etched iron, the rate of reaction between pure iron and pure 
water at a temperature between 300 and 400 C will be high. 
Consequently it is to be expected that the effects of chemical 
reaction will be apparent where indentation has occurred and this 
is what Fig. 9 shows. 

The contribution which this mechanism will make to the total 
loss of metal from the surface may be expected to vary with the 
severity of cavitation in the sense that, with very severe indenta- 
tion, the metal will run out of ductility relatively soon and fail 
by the type of rupture outlined as the first possible mechanism of 
rupture, whereas with cavitation of milder severity the cumula- 
tive effect of oxide formation at every indentation, and the subse- 
quent loss by fracture and dislodgment of relatively brittle oxide 
particles, combined with weakening due to pounding of oxide into 
the metal lattice, will preponderate. 
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DISCUSSION 
Saip Alpay® 

An important contribution has been made by this author to 
the investigation of indentation of metals by cavitation. The 
color photomicrographs taken by the author visualized so clearly, 
as never before, the development of the damage. Furthermore, 
he suggests a possibile mechanical explanation for the mechanism 
of rupture and loss of metal. Most previous investigators have 
attributed cavitation to a mechanical effect and were not able 
to provide a satisfactory explanation for its development. They 
described it in general terms as a ‘“‘water-hammer effect,’”’ a 
“mechanical shock,” or an “action of the vapor pockets, in par- 
ticular, their subsequent condensation.’’ These expressions 
were far from analyzing the whole effect of the destruction in 
different stages. Especially the first part of the suggested 
theory in this paper offers an excellent explanation of the mecha- 
nisms of rupture by cavitation and is in close accordance with the 
concept of mechanical destruction. 

In addition to the mechanical process, the author has observed 
that chemical reaction undoubtedly has occurred between iron 
and water and aluminum and water. But he prudently withholds 
this argument that loss of metal by cavitation is only a result of 
chemical action. In this paper, the rise of temperature, due to 
indentation, and the rate of chemical reaction are overstressed 
and their importance and significance in the whole process may 
be misunderstood, although the author has made a painstaking 
effort to avoid it, but a hurried generalization may lead to severe 
errors. The author has investigated the case of pure aluminum 
and pure iron in water; he has found that corrosion occurs as a 
secondary effect. It is highly improbable that the same result 
is to be obtained with other metals. 

In 1936, at the Technical University of Berlin, we demonstrated 
the primary mechanical nature of cavitation damage. The 
experiment consisted of running, at high speed in ordinary labora- 
tory water, a small brass model of a propeller, and producing 
severe cavitation. One fourth of the blade ends were destroyed 
literally “‘shaved’’—in less than an hour. The destroyed section 
showed only brilliant crystals to the naked eye, as if it had been 
cut off with a chisel and hammer. A piece of this section and 
another broken piece of the same specimen of metal, under a 
normal microscope, showed the same crystal pattern without 
any evidence of oxidation. It can be stated that in this case 
the loss of metal was not due to any chemical action. 

A similar experiment can be made with glass, which is inert 
to water, and cavitation damage can be observed. From these 
two experiments it can be concluded that the old concept of 
correlating cavitation damage with a chemical action has no 
chance of survival. 

Although cavitation is primarily a mechanical process, it 
cannot be denied that secondary consequences, such as a rise in 
temperature, chemical reaction, and electrolysis between strain 
points are taking place. In some cases, even the secondary 
effects might play an important role in the rupture of metal. 
But that must be understood, just as in the grinding process of 
metals, where the grinding is the main mechanical operation, 


8 Visiting Professor from Yildiz Technical School and The Techni- 
cal University of Istanbul, Turkey, University of Michigan, College 
of Engineering, Ann Arbor, Mich. 
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but is more or less accompanied by a rise in temperature and a 
burning of the particles. 

Further investigations with different metals and alloys may 
clear the degree of importance of the secondary effects, but it is 
highly improbable that they would change the concept of the 
essentially mechanical nature of cavitation. 


S. Logan Kerr‘ 


Dr. Wheeler has presented a very interesting extension of some 
of the work of the late Dr. Robert Knapp on the subject of the 
inception of cavitation. 

Some years ago Mr. Boetcher® analyzed with microphotographs 
the cavitation damage that resulted from the test stand of the 
Pennsylvania Water and Power Company at Holtwood, Pa. 

Mr. Boetcher was one of the first to detect the intercrystalline 
fractures caused by cavitation and confirmed in many respects 
that the damage was primarily mechanial in nature. 

Considering again the basic mechanical nature of cavitation 
erosion, it should be noted that damage has been produced in a 
water circuit impinging upon quartz glass which is completely 
inert to chemical reactions. 

The exploratory work that the author has done is naturally of 
great interest and will push back the frontier of the unknown 
conditions in cavitation. It is evident, however, that the major 
factor in cavitation erosion is due to the mechanical and dynamic 
effect and the chemical or metallurgical changes are not the 
dominant cause but merely increase the magnitude of damage. 

The speculation as to the effect of temperature rise of the 
material itself is of considerable interest. It is also important 
to recognize the variation in cavitation damage as the function 
of the vapor pressure of the fluid. In the writer’s early work‘ 
using the vibratory method as a part of the program carried out 
at the Massachusetts Institute of Technology in 1935 and 1936, 
this particular effect was studied with some care. 

In the test made with both fresh water and sea water the effect 
of temperature was examined at some five points between 3 and 
50 deg C. It was significant that the damage to brass specimens 
reached a maximum at 20 to 30 C whereas for cast irons and steels 
the maximum damage conditions appeared to be at or about 50 C. 
The destructive action with temperature appeared to increase 
from low temperature points up to a maximum and then decrease 
until a temperature was reached where practically no damage 
took place even after 90 minutes of exposure in the vibratory 
apparatus. 

The author does not indicate whether the temperature was 
precisely controlled in his studies and this may have some sub- 
stantial influence on the results. 

It is suggested that Figs. 5, 6, 7, 8, and 9 should have the test 
liquid identified for convenience of the reader. It is noted that 
Fig. 9 is not identified as to fluid either in the illustration or the 
text. 

It might also be stated that Dr. Wheeler’s investigation proba- 
bly brings to light one of the steps in fatigue of the material sub- 
ject to cavitation. Again in the writer's investigation previously 
referred to, it was suggested that the vibratory method of measur- 
ing cavitation erosion was also a possible means for establishing 
fatigue limits in materials. If this is coupled with the use of 
corrosive liquids as investigated by the author, this may be a 
further means of evaluating fatigue limits where corrosive factors 
are also present. 

* Consulting Engineer, Flourtown, Pa. Fellow ASME. 

5 H. N. Boetcher, “Failure of Metals Due to Cavitation Under 
Experimental Conditions,’"’ Trans. ASME, vol. 58, 1936, paper 
HYD—58-1, pp. 355-360. 

* “Determination of the Relative Resistance to Cavitation Erosion 


by the Vibratory Method,” Trans. ASME, vol. 59, 1937, pp. 373- 
397. 
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W. C. Leith’ 


The analysis of the mechanism of cavitation erosion as the 
ejection of brittle strain-hardened oxide layers from the metal 
surface by a later blow releasing the stored elastic energy beneath 
the oxide layer, certainly accounts for the crater-shaped pits and 
the irregular eroded fragments that are evident in accelerated 
cavitation tests. The suggestion that toluene may be considered 
as a chemically inert counterpart of water for cavitation testing 
purposes is valid with respect to their relative vapor pressures, 
but Novotny in his German paper (Destruction of Materials by 
Cavitation, 1942) reported that the increased weight loss in water 
was due to the surface tension effect. It is interesting to note that 
water shows more than double the weight loss, and the surface 
tension of water at room temperature is more than double that of 
toluene. Have you considered high speed photography to check 
whether the size and number of cavitation bubbles are compa- 
rable in water and toluene? 


Wm. J. Rheingans® 


The author has performed some excellent research and ex- 
perimental work in obtaining a visual record of the mechanical 
damage produced by cavitation. There bas been considerable 
discussion in the past regarding the possibility of chemical action 
in connection with cavitation damage on materials. The author 
has definitely established that there is a certain amount of 
chemical action, which accelerates the removal of metal. He 
suggests that, in order to get a true comparison of the resistance 
of various materials to cavitation damage, the tests be made with 
a liquid like toluene which is chemically inert. It would be in- 
teresting to run a series of tests on various materials using this 
liquid to determine whether chemical action has an appreciable 
effect on their relative resistances. However, since most of our 
cavitation damage problems are in connection with materials ex- 
posed to water cavitation, accelerated tests with water would 
give a better indication of the relative merits of the materials for 
such usage. 

When we recognize that chemical action plays an important 
part in cavitation damage, we have one important factor unre- 
solved in comparing accelerated tests with field performance. 
This is the time factor. The chemical action due to temperature 
rise at the indentation point is conceivably the same in both ac- 
celerated tests and in field performance. However, since ac- 
celerated tests of two hours’ duration sometimes represent several 
years of cavitation action in the field, there is the question of the 
part chemical action plays in field erosion due to the large dif- 
ference in the time scale and the consequent large interval of time 
between indentation blows. 


Irving Taylor® 


The author is to be commended for persistence in seeking to 
discover more steps in the cavitation damage process, and for 
worthy efforts to delineate local chemical actions from mechanical 
actions therein. 

In his last sentence one part seems questionable. He speaks of 
metal weakening “‘due to pounding of oxide into the metal 
lattice.’’ It is possible that particles of oxide might be pounded 
into very small crevices but hardly into the metal lattice. Mole- 
cules of oxide are not small enough to penetrate between atoms 
of iron, or between atoms of iron and any other alloying element, 


7 Mechanical Research Engineer, Dominion Engineering Works, 
Limited, Montreal, Canada. Mem. ASME, 

§ Assistant General Manager, Engineering—Hydraulic Division, 
Allis-Chalmers Manufacturing Company, York, Pa. Mem. ASME. 

* Engineering/ Research Specialist, Marquardt Aircraft Company, 
Ogden, Utah. Mem. ASME. 
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in the lattices. Practically the only atoms small enough to do 
this are free hydrogen atoms. 

Strangely enough, free hydrogen atoms quite possibly do pene- 
trate the solid’s lattice at the cavitation bubble collapse region.” 
For very hard and strong metals subjected to cavitation in water 
or in toluene, if the solid surface will not yield (become indented) 
under the impacts, the reflection of the shock waves at the 
liquid-solid interface should have a shattering effect on the 
layer of liquid molecules that are partially bonded to the solid 
surface atoms. Thus hydrogen atoms, broken free from water or 
from toluene by partial dissociations from impacts of pressure 
shocks due to nearby cavity-collapse-impacts, may in fact be 
driven into the metal lattice. 

If we imagine the temperature of the infinitesmal liquid layer 
closest to the solid surface at the instant when a nearby cavity- 
collapse shock wave reaches the interface and has to be reflected 
rather than transmitted, this very local temperature probably 
reaches far above the temperatures required for dissociation. 
The dissociations will not necessarily produce ions; un-ionized 
free radicals and free atoms are just as likely, perhaps more so. 
(Chemical actions are possible here too, since such fragments 
are not inert.) And the force to push some free hydrogen atoms 
into the metal lattice is present in the same shock wave that does 
the dissociating, plus that from rapidly succeeding shock waves. 

Once in the lattice and moved around by thermal persuasions, 
the hydrogen atoms may sooner or later each acquire an extra 
electron, and become hydrogen ions—of larger size than the free 
atoms—and less free to move within the lattice. Also some may 


10T. Taylor, discussion of paper ‘‘On the Mechanism of Cavitation 
Damage,” by Plesset & Ellis, Trans. ASME, vol. 77, 1955, p. 1064. 


194 / marcu 


join each other to form hydrogen molecules. Thus the mechanism 
for incipient “fatigue cracking’’ or embrittlement of the solid 
near the cavity collapse region is presumed. 

The author would seem to imply from his two proposed possible 
mechanisms of rupture, that incubation periods or induction 
periods in cavitation testing are the times taken up by the in- 
dentation phase or the oxidation phase—before the overlappings 
of indentations and the dislodgments begin to create a measurable 
weight loss. This discusser would suggest that incubation periods 
may also be the times taken up, by the penetrations and first 
accumulations in the metal lattice of hydrogen atoms, before the 
fatigue cracking and dislodgings can begin. 

Can it be assumed that indentation is necessarily the first stage 
in the cavitation damage process? When considering aluminum 
or cold rolled pure iron in water or in toluene, probably itis. Even 
considering harder and stronger materials in liquid metals under 
cavitation, indentation may be the first stage. Yet the mecha- 
nism that damages stellite, austenitic stainless steel, or titanium 
alloys, after quite long incubation periods in an accelerated test- 
ing apparatus using water or toluene, may be some other. 


Author's Closure 


I greatly appreciate and value the comments made by the dis- 
cussers and sincerely regret my inability to pursue their points 
further or answer them properly at this present moment, owing to 
absence of my records and results due to dismantling, packing, 
and shipment of the Laboratory, associated with the assumption 
of new duties for the U. K. Government. But I hope to deal with 
them in a future publication which will be concerned particularly 
with the measured temperature rise at indentation points. 
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A Shock-Tube Technique to Determine 
Steady-Flow Losses of Orifices 
and Other Duct Elements’ 


It is shown that a simple shock tube is capable of producing appreciable steady-flow 
rates through a short duct element, such as an orifice, a valve, or a screen. The flow up- 
4 stream and downstream of the test element and, therefore, also the losses caused by 
the test element, can be calculated from known initial conditions in the shock tube and 
pressure measurements at one point upstream of the element. Experiments to determine 
the discharge coefficient of a sharp-edged orifice are described as an illustration of the 
method. The results are in good agreement with available steady-flow data. 


GEORGE RUDINGER 


Principal Physicist, Cornell 
Aeronautical Laboratory, 
Inc., Buffalo, N. Y. 


1 Introduction 


T IS FREQUENTLY DESIRED to determine the losses 
in a gas flow caused by certain elements in a duct system, such 
as valves, screens, orifices, or short bends. In a steady flow, the 
performance of such elements can be measured readily, but a 
gas-supply system for the high flow rates that are sometimes re- 
quired may not be available. It will be shown that a simple shock 
tube often is capable of providing the desired flow rates and pres- 
sures for a short time. 

The principle of the method and the calculation of the flow 
upstream and downstream of the test element will be described in 
some detail, but reference to other publications, for instance, 
{1, 2, 3],? must be made with regard to fundamentals of shock- 
tube operation and the analysis of nonsteady flows by means of 
wave diagrams based on the method of characteristics. 


ment must be the same as those of the duct in which it is ulti- 
mately to be used. A pressure transducer is mounted a short dis- 
tance upstream of the element to allow recording of the pressure 
variations caused by the incident shock wave and its reflection 
from the test element. 


Fig. 1 shows the various components of a shock tube with the 
element to be tested located some distance from the downstream tt 
end of the expansion chamber. To produce the proper flow, the —t ‘ 2 
shock-tube diameters upstream and downstream of the test ele- ~ 160" 20" 


1 This work was carried out as part of Project SQUID which is 
supported by the Office of Naval Research under Contract Nonr- 
1858(25). Reproduction in full or in part is permitted for any use of 
the United States Government. 


2? Numbers in brackets designate References at end of paper. 
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Fig. 1 Sketch of shock-tube arrangement and wave diagram 


Contributed by the Fluid Mechanics Subcommittee of the Hy- 
draulic Division, and presented at the Hydraulic Conference, Ann 
Arbor, Mich., April 13-15, 1959, of Tae American Society or 
MECHANICAL ENGINEERS. 

. Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, Decem- 
ber 19, 1958. Paper No. 59—Hyd-13. 


Nomenciature— 


A sketch of the wave pattern subsequent to the breaking of the 
shock-tube diaphragm is shown in the upper portion of Fig. 1. 
Steady flow through the test element is produced after the inci- 
dent shock wave has been reflected, but this flow is not established 
instantaneously. The transients that occur when a shock wave is 
reflected from an orifice at the end of a duct have recently been 
described by Rudinger [4]. As an incidental result of this study, 
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A = duct area upstream of test element, K = ASME flow coefficient for incom- B = ratio of orifice diameter to duct 
sq in. pressible fluid diameter 
A’ = duct area downstream of test ele- M = Mach number Y = ratio of specific heats 
ment, sq in. m = orifice-area ratio p = density, pef 
a = speed of sound, fps p = absolute pressure, psi 
C = discharge coefficient, ratio of ac- r = ps/ps, pressure ratio across orifice Setinceiyte 
tual to theoretical flow rate T = absolute temperature, deg R 8 = stagnation conditions 
c, = specific heat at constant volume u = flow velocity, fps j = conditions in jet discharge 
D, = orifice diameter, in. w = velocity of incident shock wave, from orifice 
G = flow rate, lb per sec fps 1,2,..7 = steady-flow regions as indi- 
g = acceleration of gravity, 32.2 fps* Y = ASME expansion factor cated in Fig. 1 
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effective orifice areas, or discharge coefficients, were obtained, 
and it is the purpose of the present paper to discuss this aspect 
of the experiments in more detail. Steady flow is established 
through the mechanism of a complicated system of diffracted 
waves which may extend somewhat beyond the length of the test 
element. The time required to reach steady flow depends on the 
effective length of this wave system. It thus affects the various 
shock-tube sections, which must be of sufficient length to insure 
that the pressure in the final steady flow can be measured before 
any interference from waves, that are reflections from the inter- 
face or one of the ends of the duct, occurs. To prevent the need 
for excessively long shock tubes, the test element, therefore, must 
be short, and a length of a few duct diameters probably represents 
a reasonable upper limit. Since the final flow for a given element 
is independent of the starting details, the flow analysis can be 
carried out as if steady flow were instantaneously established, as 
indicated in Fig. 1. 

Somewhat similar shock-tube studies have been carried out in 
the past, but the flow conditions upstream and downstream of the 
test element were mostly inferred from measured propagation 
velocities of the transmitted or reflected waves. Because of the 
lag in the establishment of steady flow, these waves are not pure 
shocks but have a more complicated structure, for instance, a 
shock followed by an expansion wave (see Section 4). Since there 
is no relation between the total flow change produced by a wave 
and the propagation velocity of its head—except for a pure shock 
wave—any technique based on measurements of wave velocities 
may be subject to systematic errors that are not evaluated readily. 
If one is dealing with test elements such as fine-mesh screens, the 
disturbances produced by the incident wave are almost uni- 
formly distributed over the cross section of the duct. The effects 
of the transient phenomena are then minimized, although they 
may be noticeable for surprisingly large distances from the test 
element [4]. Several studies of shock-wave interactions with 
screens have been published. A schlieren streak record by Glass 
and Patterson (plate 18 of reference [1]) clearly shows a slowing 
down of the reflected shock wave. Bowman and Niblett [5] de- 
termined wave velocities from schlieren photographs and con- 
cluded from some discrepancies of the resulting screen drag that 
the strength of the reflected wave probably had been over- 
estimated. In addition to their explanations, the effect of the 
starting transient may also have been a contributing factor. 
Franks and Hall [6] determined wave speeds from schlieren .reak 
records and density from interferometric measurements. They 
obtained quite good agreement with screen drag measured in 
steady flow. Dosanjh [7] used hot-wire probes to measure wave 
velocities, and Barthel [8] determined densities interferometri- 
cally. Both authors briefly discussed screen drag, but they were 
mainly concerned with the various possible flow patterns down- 
stream of the screens. 

Kawamura and Kawada [9] observed the retardations of shock 
waves caused by various bluff obstacles in their path and related 
these to the drag of the models. They noted that the transient 
phenomena associated with the establishment of steady flow 
around a model introduced some uncertainty into their measure- 
ments. In the already mentioned study of transients [4], the 
pressures behind the incident anu ieflected waves were measured 
directly, so that the portion corresponding to the transient phe- 
nomena could always be recognized. 


2 Principle of the Method 


The steady-flow regions in Fig. 1 are separated by waves and 
interfaces and are identified by numbers 1 to 7. Regions 1 to 4 
represent the basic shock-tube pattern; that is, the condi- 
tions ahead of and behind the incident shock wave, and behind and 
ahead of the expansion wave in the driver gas, respectively. Re- 

:gions 5 and 6 represent the flow upstream and downstream of the 
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test element. Region 7 represents the gas which initially was 
downstream of the test element, and is separated by an interface 
from region 6 because of the entropy rise resulting from the losses 
in the flow through the test element. The flow conditions at 
any point are determined by the flow velocity u and by the state 
of the gas. The latter is most conveniently described by the 
speed of sound a and the absolute pressure p. Other flow varia- 
bles, such as density p and temperature 7’, can then be computed 
from the basic relations between the variables concerned. 

The incident and reflected waves separate regions of steady 
flow and are thus simple waves. The change of one flow variable 
across such a wave is sufficient to determine the change of all of 
them if it is either a pure shock wave or if all changes of state are 
isentropic [2,3]. Because of the already mentioned transient as 
the flow is established, the reflected and transmitted waves are in 
general not pure shocks. Since a shock wave merging with 
another wave produces no reflected waves only if the changes of 
state are isentropic, the experiments must be limited to sufficiently 
weak shock waves. Restriction to such waves is also desirable to 
avoid significant effects of boundary-layer development with 
consequent attenuation of the shock waves and shock wave- 
boundary layer interactions [10, 11]. Within the accuracy re- 
quired here, shocks with pressure ratios up to about 2.5 can be 
considered as producing isentropic changes of state. It will be 
seen in Section 3 that appreciable flow rates can be produced in 
spite of this limitation on the shock strength. 

Because of the limitation to weak shock waves, the established 
steady flow is subsonic both upstream and downstream of the test 
element. This condition must be satisfied in any case, because 
no deductions can be made in any supersonic flow about conditions 
downstream of the point of observation. It is possible for the 
flow to become sonic or supersonic within the test element, or in a 
region limited to its immediate vicinity, but the accompanying 
shock losses are then included in the measured losses of the test 
element. 

The assumptions on which the following analysis is based are 
that one is dealing with a quasi-one-dimensional flow of an ideal 
gas with constant specific heat, that there is no heat exchange 
with the walls, and that the effects of wall friction in the duct 
but not in the test element—can be neglected. The last assump- 
tion requires that the pressure measurement not be made too far 
upstream of the test element in order to avoid a significant pres- 
sure change between the transducer location and the test element. 
This location corresponds, essentially, to the location of an up- 
stream pipe tap recommended by the ASME rules for orifice 
meters [12] 

The flow upstream and downstream of the test element can be 
determined from the initial state of the gas in region 1 and the 
pressures in regions 2 and 5. Because of the condition that 
changes of state be isentropic, the sound speeds in regions 2 and 
5 can be obtained from the relations 


and 5 = a 
Pr 


where ¥ is the ratio of the specific heats. In addition, one has the 
characteristic relation for the change of flow conditions across a 
simple wave [2, 3] 
2 
a+u = const (3) 
y-1 
where the upper sign applies to waves traveling in the direction 
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of a negative flow velocity, such as that of the reflected wave, 
and the lower sign applies to waves traveling in the opposite 
direction, such as that of the incident shock wave. Since u, = 0, 
one obtains 


ue = — (a, — a) (4) 
y¥-1 
and 


Us = U2 — —~ (a5 — &) = (2a, — a, — a5) (5) 
Y 

The flow upstream of the test element is thus determined by ps 

and Equations (1), (2), and (5). 

If the test element is mounted at the end of the duct, a jet dis- 
charging into an atmosphere of pressure p; is established. The 
performance of the test element can then be evaluated by com- 
paring the measured flow rate with the flow rate through an 
ideal test element, computed from the upstream stagnation pres- 
sure and temperature (see Section 4). 

If the test element is not located at the end of the duct, the 
downstream conditions remain to be found. Let A and A’ de- 
note the cross-sectional areas of the duct upstream and down- 
stream, respectively, of the test element. Conservation of mass 
flow can then be expressed by 


poAus 

(6) 

ase ds 

Because of the assumption that no heat exchange with the walls 

takes place, the stagnation temperature of the flow and therefore 

also the stagnation speed of sound remain constant through the 
test element, or 

9 


a? + us? = as? + u;? (7) 

The pressure ‘and velocity must be continuous across the inter- 
face between regions 6 and 7, and the relations that hold across 
the transmitted shock wave are analogous to Equations (1) and 
(4) so that the relation 


9 9 
Us = — (az — a;) = — a; (2) —1 (8) 
y-1 


must hold, where the substitutions uy = ug and p; = ps have been 
made. The system of Equations (6), (7), and (8) for the un- 
knowns @¢, Ps, and us is best solved by iteration. One might 
estimate the value of ps, calculate us and as from Equations (7) 
and (8), and repeat this procedure until Equation (6) is satisfied. 

The flow conditions in regions 5 and 6 determine the per- 
formance of the test element. The losses may be expressed, for 
instance, by the ratio of the downstream to upstream stagnation 
pressures, p.«/P,.s. Closely related to this ratio is the entropy rise 
from region 5 to region 6. Other performance criteria also might 
be used. As an example, the determination of orifice coefficients 
is described in Section 4. 


3 The Shock Tube as a Gas-Supply System 


If a loss-free test element is mounted in a constant-area duct 
(A’ = A), the flows in regions 5, 6, and 7 of Fig. 1 are, clearly, 
identical. It follows then from Equations (5) and (8) that a, and 
a, are also equal. Consequently, there is no reflected wave under 
these circumstances, and the flow rate produced by the incident 
shock wave passes unmodified through the test element. One 
may thus consider the flow rate in region 2 as representative of 
the capabilities of a shock tube as a gas-supply system. 

Let G, denote the flow rate behind the incident shock wave. 
The flow per unit area is then given by 
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Ge Pilla Pr Pr (9) 
A 144 

where g is the acceleration of gravity, and M = u/a is the Mach 
number. In the last term, the density has been eliminated with 
the aid of the relation for the speed of sound, a? = 144gyp/p. 
The factors g and 144 are the result of the particular units used 
(see Nomenclature). Since the analysis is restricted to weak 
shock waves for which changes of state can be treated as isen- 
tropic, Equations (1) and (4) may be used to express the Mach 
number and flow rate in terms of the pressure ratio in the form 


9 2y 
iyo —— = (10) 
and 

27 

A a; Pr Pr 


These relations are plotted in Fig. 2 for air (y = 1.4). The 
curve for the flow rate is based on an initial temperature of 540 
deg R, corresponding to a; = 1139 fps, and on p; = 14.7 psi to give 
an indication of the flows that can be obtained with initially at- 
mospheric conditions. 

It is also of interest to determine the shock-tube conditions re- 
quired to produce a prescribed flow rate and stagnation pressure. 
The latter is given by 


Pa = Po (: + (12) 


so that p, can be eliminated in Equation (9), and one obtains 


— 
G M -1 
= (: ++ (13) 
APa aq, 


where a,/a; and M, are functions of the shock pressure ratio ac- 
cording to Equations (1) and (10). It is, thus, possible to plot 
G:/Apa and pe/p, as functions of p2/p,, as shown in Fig. 3 for an 
initial temperature of 540 deg R and y = 1.4. 

The use of Fig. 3 is illustrated by the following problem. What 
is the shock strength and initial pressure to produce an air flow of 
10 lb per sec through a duct of 10 sq in. with a stagnation pres- 
sure of 100 psi? The answer is indicated in Fig. 3 by the broken 
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Fig. 2. Typical performance of a shock tube as an air-supply system 
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Fig. 3 Chart for computing shock-tube conditions required for given 
flow rate and stagnation pressure for y = 1.4 and T; = 540 deg R. 
Broken line illustrates example discussed in the text. 


line. For G./Ap = 0.01 sec~, one obtains p2/p; = 1.51, and for 
this shock pressure ratio, the other curve yields pa/p; =1.59. 
Consequently, the initial pressure in the shock tube must be 
~: = 100/1.59 = 62.9 psi. 

It remains to determine the initial pressure behind the shock- 
tube diaphragm 7, to produce the desired shock strength. Ele- 
mentary shock-tube theory [1] yields the relation 


Pr Yi +1 po Cul's 
S41 
(14) 


where c, is the specific heat at constant volume. This formula is 
quite general and applies also if the driver gas is not the same as 
the driven gas and if these gases have different initial tempera- 
tures. It has been the general experience [1] that the actual 
shock strength is well predicted by Equation (14) as long as one 
is dealing with such weak shocks as are considered here. If one 
assumes in the foregoing example that air of 540 deg R is used as 
the driver gas, Equation (14) yields p.4/p; = 2.40, or p, = 151.0 
psi. 

The stagnation temperature of the flow 7’ is related to the 
stagnation pressure by 


y-1 


Ta 
Ti Pr 
This ratio is also a function of p:/p:, and the temperature rise 
T.. — T; is plotted in Fig. 2 for 7; = 540 deg R. These tem- 
perature rises are smaller because of the isentropic changes of 
state than any obtainable with mechanical compressors unless the 
compressed gas is cooled before being used. 

The foregoing discussion shows that a considerable range of 
flow rates and stagnation pressures can be produced in a shock 
tube. In general, the flow behind the incident shock wave will 
be modified by a reflected wave which depends on the par- 


ticular test element. The actual flow must be determined by 
measurements as outlined in Section 2. 


4 Experimental Investigation of Orifice Flows 
To illustrate the techniques described in the preceding sections, 


(15) 
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a number of experiments were carried out to investigate the flow 
through orifices. A shock tube constructed of cold-drawn brass 
tubing was available for these experiments, and its dimensions 
are indicated in Fig. 1. One to three layers of photographic film 
served as diaphragms which could be ruptured by a solenoid- 
operated plunger. The expansion chamber either terminated at 
the orifice or could be extended 2 ft beyond it. 

Two flush-mounted condenser transducers (Rutishauser 
Model HR3 with electronic indicator ST-12) could be mounted at 
various distances upstream of the orifice plate, and the pressure 
at one of these locations was recorded by means of a cathode-ray 
oscilloscope. A calibration was obtained for each record by 
measuring the velocity of the incident shock wave w between the 
locations of the two transducers with a microsecond counter. 
The pressure rise produced by an incident shock wave then fol- 
lows from the Rankine-Hugoniot shock relations as 


Since the response of the transducers and the recording system 
was linear over the required range, other pressures could be 
evaluated readily from the records. All experiments were carried 
out with initially atmospheric conditions so that a; was deter- 
mined by the room temperature, and p; was the barometric pres- 
sure. Pressure records were obtained at points from 6 to 18 in. 
from the orifice plate. The results were independent of this dis- 
tance within the accuracy of the measurements, thus justifying 
the assumption of negligible effects of wall friction. 

Three orifice plates were used in the experiments and their de- 
sign is indicated in Table 1. The ratio of orifice diameter to duct 
diameter is denoted by 8, and the area ratio by m, so that m = 
B? for single holes. 

Four typical pressure records, obtained with orifice No. I, are 
reproduced in Fig. 4 where the particular experimenta! condi- 
tions are also noted. These records clearly show the pressure 
rise, caused by the incident shock wave, and the steady state 
after wave reflection. The establishment of steady state is pre- 
ceded by a transient which is particularly noticeable if the orifice 
is located at the end of the duct. This transient was the subject 
of a previous study [4] and is disregarded here. Finally, the useful 
testing time is terminated when another wave, visible only in 
records (c) and (d), reaches the transducer. The steady-flow 
pressures can be evaluated from such records as indicated in the 
foregoing. 

Consider first the experiments in which the orifice was located 
at the end of theduct. Because of the contraction of the outflowing 
jet, the observed flow rate is smaller than it would have been for 
an “ideal” orifice without a vena contracta. The ratio of the actual 
to the ideal flow rate is equal to the ratio of the effective to the 
actual orifice area and defines the coefficient of discharge C. The 
flow conditions in the ideal orifice can be calculated since the 
stagnation pressure and the stagnation temperature in the jet are 
the same as in region 5 upstream. Furthermore, the static pres- 
sure p,; must be equal to the ambient pressure p; as long as the jet 
is subsonic. The jet Mach number M, then follows from the re- 
lation for the stagnation pressure 


(16) 


y-1,,,\7! 

= Po = Pj (: + (17) 

with p; = pm: provided that M, turns out to be less than 1.0. 

Otherwise M; = 1.0 and Equation (17) becomes a relation for 

p;- From the flow conditions in the jet, one can compute the 
ideal flow rate and, thus, the coefficient of discharge. 

Calculations as outlined here were carried out for the three 
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orifice plates described in Table 1 and for two strengths of the 
incident shock wave. The results are collected in Table 2. 

Each line listed in the table represents the mean of several, 
mostly five, expe:iments which, generally, agreed with each other 
» within about 1 per cent. A comparison of these results with 
other data would be of interest, but no adequate air supply was 
available to determine discharge coefficients in the conventional 
rnanner, and no published data for the conditions used here could 
be found. However, such a comparison seems possible for ori- 
fice No. I which conforms to the ASME specifications for orifice 
meters [12]. Since the discharge pressure at the vena contracta is 
equal to p;, the standard orifice meter with vena contracta taps 
should be a close approximation to the present conditions. (A 
similar assumption was successfully used by Cunningham [13, 14] 
in a tueory of jet contraction.) Neal [15] carried out such experi- 
ments in a 5-in. pipe and compared his observations with calcu- 
lated data based on the ASME Fluid Meters Report [12]. His 
results, interpolated for the size of orifice No. I are also included 
in Table 2. It is seen that the shock-tube results are in agree- 
ment with Neal’s data. Since discharge coefficients depend 
critically on details of the orifice design, no similar comparison 
can be made for the other two orifices which do not conform to any 
standard. As one would expect, their discharge coefficients are 
somewhat larger than those for orifice No. I. 

Since Neal’s experimental conditions were not exactly the same 
as those employed in the present study, some experiments were 
also carried out in which orifice No. I was followed by a 2-ft sec- 
tion of the shock tube. Incident shock waves of pressure ratios 
between about 1.3 and 2.2 were used, and the resulting flow rates 
are shown in Fig. 5. The corresponding losses, expressed as the 
downstream to upstream stagnation pressure ratio p/p», are 
also indicated in this figure. Each plotted point represents the 


Table 1 Design of orifice plates® 
No. B m 
I Single hole with 45-deg bevel 
. on downstream side, leav- 
ing a rim of 0.02 in. 0.719 0.516 
II Single straight hole 0.861 0.741 
Ill 19 straight holes, uniformly 
distributed over duct area 0.516 
. * All holes are circular, thickness of the plates is 0.25 in. 


Table 2. Results of experiments with orifice plates at the end of the shock 
tube 


Author Neal [15]* 


Ori- Pw Pas, G;, 

fice Pi Pi psi lb/sec exp. calc. 
26.3 1.74 0.76 0.79 0.76 
1.2, 1.24 17.4 0.84 0.68 0.70 0.65 

II 1.58 1.60 23.0 2.44 0.83 
1.22 1.12 16.1 1.15 0.82 

III 1.88 1.78 25.4 1.89 0.86 
1.22 1.18 16.9 0.94 0.83 


* Results for a 5-in. pipe and vena contracta taps. 
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Fig. 4 Typical pressure records obtained with orifice No. | 
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Fig. 5 Flow rate and stagnation-pressure ratio for orifice No. | mounted 
2 ft from end of shock tube 


mean of four experiments which agreed with each other within 
about 1 per cent. Smooth curves are drawn through the ex- 
perimental] points to demonstrate the smallness of scatter. It also 
can be seen that the results are independent of the distance at 
which the pressure is recorded. 

It follows from the manner in which the flow conditions are 
evaluated that the pressures p; and p, correspond to upstream 
and downstream measurements at pipe taps of an orifice meter. 
Since the flow rate is determined directly, it is possible to compute 
the orifice coefficients and to compare them with ASME orifice 
data for pipe taps [12]. In the present notation, the ASME rela- 
tion for a sharp-edged orifice is given by 


Gs = 0.525KYD,*{psp1 — (18) 


where r = p¢/ps is the pressure ratio across the orifice, D, is the 
orifice diameter, K the flow coefficient for an incompressible fluid, 
and Y the compressibility factor. In Equation (18), all quanti- 
ties are known from the experiments except the product KY 
which can, thus, be computed. It can also be obtained directly 
from the ASME data for pipe taps. The orifice Reynolds number 
in these experiments is of the order of 5 X 10° which is in a range 
where K is practically independent of the Reynolds number. For 
orifice No. I, the ASME tables yield K = 1.015. The compressi- 
bility factor is given by the relation 


Y = 1 — (0.333 + 1.145(6* + 0.76" + 128%) \(1 — r)/y 
= 1 — 0.905(1 — r) 
for y = 1.4 and orifice No. I. 
The ASME values of KY and the experimental results are 


plotted in Fig. 6. The experimental data agree fairly well with 
the values predicted by the ASME relation with the exception of 


(19) 
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Fig.6 Orifice coefficient for orifice No. | mounted 2 ft from end of shock 
tube and comporison with ASME standard data 


the point corresponding to the lowest flow rate, for which the 
measured coefficient is about 13 per cent too high. This point 
should he omitted from the following discussion because, accord- 
ing to Equation (18), an error in r of 1 per cent would account for 
an error in KY of 10 per cent if the pressure ratio is as low as 1 — r 
= 0.05. 

The ASME relation is well established for subcritical orifice 
flow, but Cunningham [13, 14] has shown that the compressibility 
factor is lower than predicted once the flow becomes supercritical. 
He found that the ASME relation for pipe taps ceases to be ac- 
curate for r < 0.77 and should then be replaced by 


Y = Yon — const (0.77 — r) (20) 


where Yo 7 is given by Equation (19) for r = 0.77. The value of 
the constant in Equation (20) varies somewhat with 8. It is 
0.364 for 8 = 0.15 so that the value of Y is about 2 per cent below 
the ASME value at r = 0.6. Although Cunningham reports 
extensively on various other taps, his published information on 
pipe taps is limited to the foregoing data. Extremely few investi- 
gations on supercritical flow through sharp-edged orifices have 
been published [16], and these unfortunately are either limited to 
small orifices with a low approach velocity, for instance [15], or do 
not include information on pipe taps. 

The line r = 0.77 which represents the transition from sub- 
critical to supercritical flow is included in Fig. 6 and in the 
neighborhood of this value the results of the shock tube ex- 
periments are within about 4 per cent of the ASME values. At 
higher values of 1 — r, the results show a distinct tendency to 
deviate more from the ASME line, and the two extreme points lie 
somewhat more than 7 per cent below. These results thus con- 
firm Cunningham’s observation, but the accuracy obtainable with 
the shock-tube method is, of course, not sufficient for precision 
calibrations of orifice meters. 


5 Discussion 


A shock-tube technique has been described by means of which 
the steady-flow losses caused by some short element in a duct can 
be determined. The flow is produced by an incident shock wave 
and its reflection from the test element. The initial conditions 
in the shock tube and the upstream pressures before and after 
passage of the reflected wave are sufficient for calculation of the 
flow conditions upstream and downstream of the test element, 
from which the losses can be derived. The establishment of 
steady flow through the test element is not instantaneous but in- 
volves a transient process as a result of which the reflected and 
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transmitted waves are, in general, mixtures of shock and expan- 
sion waves. Measurements of wave velocities are, therefore, not 
reliable indications of the flow changes produced by the wave. 
The calculations are based on computing procedures of nonsteady 
flow for simple waves. In spite of the restriction to shock waves 
that are weak enough to cause only isentropic changes of state, 
the flow rates and stagnation pressures that can be produced 
cover a wide range. 

A shock tube represents a highly economical! gas-supply system 
since the amount of gas involved in each experiment is merely 
the volume of the shock-tube expansion chamber filled to an 
initially atmospheric or moderate pressure. It is conceivable that 
these properties may make the shock tube sometimes preferable 


to conventional compressor systems if gases other than air are to 
be used. 


To illustrate the method, discharge coefficients of several 
orifices have been determined. The experiments are well repro- 
ducible, and for a standard sharp-edged orifice the results agree 
within about 5 per cent with published data obtained in the con- 
ventional manner. Although available pressure transducers are 
not as accurate as precision manometers, the obtainable accuracy, 
as demonstrated by the experimental results, is adequate for 
many applications. 
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Separation Prediction 
for Conical Diffusers 


A method of incompressible turbulent boundary-layer analysis based on the D parame- 
ter introduced by D. Ross is applied to conical-diffuser flow. 
are shown to depend on the initial momentum-thickness Reynolds number and a distance 


The separation conditions 


parameter involving the initial momentum thickness, the initial radius, and the diffuser 


length. 


Comparison with experimental information on diffuser separation indicates 


that the predictions are reliable, but conservative. 


Ga BEHAVIOR OF NUMEROUS FLUID DEVICES, includ- 
ing diffusers, is adversely affected by the occurrence of separation. 
If diffuser flow without separation can be achieved, then high 
efficiency is possible. However, it appears that the optimum dif- 
fuser performance occurs when separation impends near the end 
of the diffuser. The prediction of diffuser separation has been 
uncertain even though the study of incompressible-fluid flow in 
smooth diffusers has received the attention of numerous re- 
searchers for the last half century. It is the purpose of this study 
to present a method for predicting and analyzing conical-diffuser 
flow by relating the manner and the degree with which various 
parameters affect the separation. 

Separation implies the achievement of zero or near-zero mean 
velocities and shear stresses at or near the wall. It does not 
occur at any specific location or for any particular flow condition, 
but instead is associated with large turbulent velocity fluctuations 
near the wall. The amplitude of these fluctuations increases and 
temporary local flow reversals occur with increasing frequency as 
the flow proceeds into the adverse pressure gradient. Initially, 
therefore, separation is a transitory phenomenon as has been ob- 
served visually by 8S. J. Kline [1]' and H. Tults [2] in plane 
diffusers. Further progress of the flow into the adverse pressure 
gradient in a diffuser eventually leads to complete and permanent 
separation. Correlation of various degrees of separation with 
diffuser performance and optimum design in plane diffusers has 
been presented by Kline, Abbott, and Fox [3]. However, the 
available conical-diffuser flow studies have not included such 
precise determination of the degrees of separation, but refer in- 
stead to conditions which indicate the occurrence of either appreci- 
able regions of permanent-type separation or of such large transi- 
tory stall areas as are detectable with pitot-tube measurements. 

This paper represents the culmination of a program of diffuser 
research initiated in 1945 at the Pennsylvania State University 
(Ordnance Research Laboratory) in connection with the design 
of a large high-speed water tunnel [4]. A review of literature in 
the field [5] indicated that most diffuser studies had been con- 
cerned with plane or two-dimensional diffusers with relatively 
thin entering boundary layers or conical diffusers with nearly 
fully developed pipe flow at the entrance. The plane-diffuser 
studies of K. Andres (1909),? H. Hochschild (1912), R. Kroner 


! Numbers in brackets designate References at end of paper. 

? References indicated by date only are fully listed in References 
[4] and [5]. 
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(1920), F. Donch (1926), and J. Nikuracse (1929) in Germany, 
and A. N. Vedernikoff (1926) in Russia were of uncertain value 
because the flows were probably not truly two-dimensional and 
were difficult to analyze because the initial boundary-layer con- 
ditions were poorly described. In the case of conical diffusers, 
the early work of A. H. Gibson (1910) involved entrance flows 
that were partly developed and gave valuable indications of the 
effect of angle. Little detail was obtained, however, on the nature 
of the flow in the diffuser. The 1921 study of G. E. Lyon pre- 
sented velocity profiles which indicated the development of the 
flow when the boundary layer at entrance was thin. The effect 
of the relative boundary-layer thickness at the entrance on dif- 
fuser efficiency was systematically studied by H. Peters (1931) in 
Germany. He found the pressure efficiency of a diffuser to be 
several per cent greater with a thin entering boundary layer than 
with fully developed flow (boundary-layer thickness equal to 
pipe or approach-duct radius). Studies of conical diffusers with 
fully developed flow at the entrance, such as reported by G. A. 
Gourzhienko (1939) in Russia and A. A. Kalinski (1944) in this 
country are valuable and suggestive, but not too pertinent to the 
present interest in partially developed entering flows. 

Two recent, if not current, programs of conical-diffuser re- 
search have involved relatively thin entrance boundary layers. 
At the National Aeronautics and Space Administration, rather 
large angle diffusers have been studied for entrance Mach num- 
bers of 0.2 to 0.9 and pipe Reynolds numbers of 10* and greater 
[6]. In some of these diffusers, separation occurred. In Great 
Britain, at the Mechanical Engineering Research Laboratory, 
F. A. L. Winternitz and W. J. Ramsay [7] have studied 5 and 
10-deg diffusers with an air-flow pipe-entrance Reynolds num- 
ber of 2.5 X 10%. The results were compared with the Penn 
State results [4] and indicated very strongly the importance of 
the initial relative momentum thickness. Annular screens were 
also used in these studies to produce different initial boundary- 
layer thicknesses and velocity-profile shapes. The diffuser be- 
havior was found to be independent of inlet-velocity-profile shape. 

Many conical-diffuser flows occur with a relatively thin 
boundary layer at the entrance. In such cases the flow is usually 
not fully developed at the end of the diffuser and the flow pat- 
tern is then as shown in Fig. 1. The wall contours are such that 
the cylindrical approach pipe or duct is joined to the conical dif- 
fuser by a gradual transition to avoid high localized pressure 
gradients. As shown in Fig. 1, the thin boundary layer at the 
entrance rapidly thickens, with marked change in the shape of 
the velocity profile, as the pressure rises and the velocity outside 
the layer decreases. In the center of the conduit an isoenergetic 
or potential core of fluid occurs which is affected by the bound- 
ary layer only through continuity relations. In this core one can 
simply relate the maximum velocity at any cross section with the 
pressure, just as one does in plane two-dimensional boundary- 


marcH 1960 / 201 


an 
: 
mY 


(be 

He 


4, 
BOUNDARY LAYER 


y-layer development in a smooth conical 


Fig 1 J. 
diffuser 


layer flows. The relation between core velocity and pressure is 
noted in Fig. 1, ef. Equation (10). The problem thus becomes 
one of predicting the behavior of a boundary layer in an adverse 
pressure gradient. It is further limited herein to the turbulent in- 
compressible-flow case adjacent to smooth walls. 

Early methods for predicting turbulent boundary-layer be- 
havior in adverse pressure gradients were based on the diffuser 
studies of Nikuradse and A. Buri (1931) or were developed by E. 
Gruschwitz (1931) based on his channel and airfoil studies. More 
recent methods of analysis have resulted from studies of boundary 
layers on the suction surface of airfoils. These methods have 
proved useful for special cases but do not appear to have uni- 
versal application. Attempts to apply them to conical-diffuser 
flow conditions have been disappointing and efforts to improve 
them resulted in an extensive series of diffuser-flow studies [4] 
and several empirical methods of predicting the boundary-layer 
behavior [4, 8]. 

Based on physical rationalization and consideration of a 
careful selection of the experimental] results of various researchers, 
D. Ross [9} in 1953 developed a promising method of analysis for 
. plane two-dimensional turbulent boundary layers in adverse 
pressure gradients. This led to predictions of the separation 
conditions for such flows by Robertson [10]. Recently Fraser 
[11] extended the Ross method to the case of axisymmetric-dif- 
fuser flow. Thus one should now be in a position to predict coni- 
cal-diffuser flows. 


Axisymmetric Boundary-Layer Analysis 


In accord with the currently accepted model, the turbulent 
portion of a boundary layer is divided into two regions. Ross 
adopted the Darey-Prandtl relation for the velocity profile in the 
outer or away-from-the-wall region. 


u 
Dil — y/6)"* (1) 


where u is the velocity at the wall distance y, U is the velocity 
outside the boundary layer which has a thickness 6, cf. Fig. 1, 
and D is a parameter uniquely characteristic of the outer region. 
For boundary layers in negligible pressure gradients 


D = 8.A(c,/2)'/* (2) 


where c, is the local skin-friction coefficient. The constant, 8.4, 
in (2) has been found to apply reasonably well to the entrance 
flow region of a pipe as well as to flat-plate boundary-layer flows. 
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As has been shown by various researchers in recent years, the fric- 
tion coefficient of smooth surfaces may be expressed as a unique 
function of the momentum-thickness Reynolds number for the 
flow. Thus for zero or negligible-pressure-gradient boundary 
layers, (2) indicates that D decreases from 0.45 to 0.25 as the 
Reynolds number increases from 200 to 10°. 

For plane two-dimensional boundary-layer flows in adverse 
pressure gradients, Ross [9] found that D increases to a value of 
about 1.3 at separation. Similarly E. M. Uram [12], Robertson 
and Holl [13], and Fraser [11] found essentially the same behavior 
in conical-diffuser flow. The separation value D, = 1.3 corre- 
sponds roughly to the conditions for which temporal-mean-veloc- 
ity measurements (pitot tube) indicate close proximity to sepa- 
ration of the permanent type. Reference to the remarks column 
of Table 1 in regard to the first six tests and further reference to 
the detailed information on the tests [4, 11, 12, 13] indicates the 
difficulty of assessing even this criterion. In fact axisymmetric 
conduits have an additional] obstacle not found in truly plane two- 
dimensional] flows, in that ‘“‘precise determination of the point of 
separation ... presents much difficulty because of the appearance 
of asymmetry in the flow pattern [6].’’ As noted in Table 1, 
separation was found in one diffuser at D = 1.38 and near sepa- 
ration at D = 1.30 and 1.35 in two other cases. These values 
fall within Ross’s tolerance of 0.1 on D, = 1.3. Therefore we as- 
sume that 1.3 is the proper separation value for D. 

The prediction of conical-diffuser separation is now reduced to 
the prediction of the increase in D as the flow proceeds. For 
distances of at least sixty times the initial boundary-layer thick- 
ness, Fraser found that the necessary relation is 


(5) 
pu (>) (3) 


where the subscript 0 indicates the value of the various factors at 
the entrance of the diffuser. In this equation R is the local 
radius of the diffuser cross-section and z is the axial distance from 
the diffuser origin, cf. Fig. 1. The factor A is given by the follow- 
ing approximation 


A & (ep)'/* — 0.038 (4) 


Equation (3) may be written in the following form 


6D A.Ro 6D/o| Us OR 
where @ is the boundary-layer momentum thickness,* and Rp is 
the radius of the cross section at the diffuser entrance (i.e., the 
radius of the approach pipe or duct). 


The growth of the momentum thickness in conical-diffuser flow 
is determined from the following relation developed by Ross and 


Robertson [14]: 
O.Ro U 
oR 


where G appears to be a unique function of the initial-momentum- 
thickness Reynolds number Rg, cf. References [9, 11, 14]. The 
necessary relation for predicting diffuser behavior is obtained by 
combining Equations (3a) and (5) thus, 


~ [az + Cas)” 


Letting f = 5/6D, the velocity ratio for separation is 


(3a) 


3 For axisymmetric flows the momentum thickness is given by the 


identity, 
8 u u 
(1-4) a- we 
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Air 
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Air 


(6) 


The subscript s is used to indicate conditions at separation. 

For plane two-dimensional boundary layers Ross [9] found a 
unique empirical correlation between 6/@ and D. Fraser [11] 
succeeded in extending this to the axisymmetric-diffuser case. 
Combining the two relations, one can write 


f tps tp (7) 


The initial value of fo, for use in Equation (6), is found using a 
value of D from (2) and an initial value of 6/R. Determination 
of the separation value f, requires knowledge of (6/R) which is not 
known. The variation in 5/@ with D and 6/R is shown in Fig. 2, 
whence 6/R is seen to be the major factor affecting 5/6 throughout 
most of the diffuser. An examination of the available experimen- 
tal information indicates that at or near separation 6/R may be 
between 0.05 and 0.09. Using these @/R values in (7) one finds 
the corresponding values for f, are 5.7 and 6.3 when D, is taken 
as 1.3. A value of 6.0 seems an appropriate tentative choice for 


Use of Equation (6) to predict diffuser separation requires that 
initial values of 6, Re, and R be specified. The terms A, G, and 
fo are then determinate. The predictions will then appear in the 


Fig. 2. Ratio of disturbance to momentum thickness 
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form of U, = function (z) indicating, for any length of diffuser, 
the permissible reduction in core velocity for imminent separation. 
Inspection of Equation (6) indicates that the separation per- 
formance of conical diffusers should depend upon their length 
z/Ro, and the flow conditions at entrance, 6:/R, and Rg. The 
separate effects of these factors are not obvious since, for in- 


stance, fy is a function of both and Rg. 


Prediction of Separation 


The effects of various factors on separation have been evaluated 
through solution of Equation (6) for a broad range of basic 
parameters. The range of initial Reynolds numbers of from 200 
to 10° includes all turbulent-boundary-layer studies made to 
date. For fully developed pipe flow, 6/R varies from 0.05 to 
0.067 (depending on Reynolds number) while few experimental- 
diffuser results are available with initial values less than 0.01. 
Since this study excludes fully developed flow, the primary caleu- 
lations were made for 0/R values of 0.01 and 0.03. A few calcula- 
tions were made for values of 0.001 and 0.06. The difference in 
results between 0.001 and 0.01 was found to be small, while the 
larger value (0.06) led to somewhat lower velocity ratios (in Fig. 
4), but these results are probably not applicable because the dif- 
fuser flow was most likely fully developed. 

The calculations indicate that, for appreciable values of z, the 
velocity ratio for separation U,/U» decreases rapidly with z*/@Ro. 
As would be expected, this predicts appreciable gains in diffuser 
performance through gradual diffusion. The individual effects of 
diffuser length and initial-relative-momentum thickness are, to a 
close approximation, taken care of by the distance parameter, 


The individual effects of z/Ro and @/R» (factors of L*) on the 
separation-velocity ratio are depicted in Fig. 3 for one Reynolds 
number. 

The variation in the separation-velocity ratio with initial- 
momentum-thickness Reynolds number and the distance parame- 
ter is shown in Fig. 4. The single curves represent both 6/Ro 
values (0.01 and 0.03) to within a third of a per cent. It is seen 
that for values of L*? below about 2000 the effect of an increase in 
initial Reynolds number is beneficial. At very small values of L? 
the variation with Reynolds number approaches that found for the 
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Table 1 Experimental observations of conical-diffuser separation ae 
a, 4 
Num- angle Rp, Area ( oO ) =z U Wwe 
ber deg in. ratio Fluid Roo R/> L? Us Ref. 
1 7 24.0 3.2¢ Water 8.0 < 10 0.010 1288 16600 0.49 15 _ 
{ 2 30 3.4 Air 0.69 0.027 13 
| Water 1.47 0.025 377 3540 059 4 
4 10 3.0 2.9 0.56 11 
| 7 5 3.75 4.0 7 2 
8 10 3.75 4.0 
| 9 12 5.0 2.0 1.2 0.019 209 
i 10 12 10.5 2.0 0.4 0.0025 1700 7200 6 od 
| 1l 12 10.5 2.0 1.8 0.013 330 1400 6 F 
| 12 23 10.5 1.9¢ 0.43 0.0028 635 1130 
* Area ratio at separation point. oe 
| U 1 sia 
Ue 
Us Az* + fi © 
| 
| 
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Fig. 3 Effects of distance and 
separation 
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Fig. 4 Diffuser-core velocity ratio for separation 


plane case [10], which corresponds to @/Ry = 0 since the lateral 
curvature of a plane surface is infinite. At larger values of the 
distance parameter (L? > 2000) the effect of an increase in initial 
Reynolds number is seen to be unfavorable. In fact, very rough 
extrapolation of the various curves suggest that they tend to 
coalesce on a U,/U» value of about 0.64 at a Reynolds number of 
10°. The calculations have not been extended to such a high 
Reynolds number since it seems unlikely of achievement. The 
Reynolds-number effect shown in Fig. 4 is not large. Actually, a 
tenfold change in Rg results in a maximum change of only about 
0.05 in U,/Uo. Although the principal factor affecting separation 
is r/R as is clearly shown in Fig. 3, the initial relative boundary- 
layer thickness is also seen to be significant. 

In concluding this presentation of the predicted results, the 
limitations of the analysis should be considered. Unfortunately, 
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these limitations are not clearly evident. Ross [9] in obtaining 
the two-dimensional analog of Equation (3) noted that it was 
limited to x/5) values of 50 or less. For asymmetric flows Equa- 
tion (3) must have a similar limit, although it has worked in one 
case for an x/dy value of 63 [11]. The velocity-momentum-thick- 
ness relation of (5) is restricted to flows in which the adverse 
pressure gradient predominates over skin-friction effects [14] and 
this restriction also limits the distance over which the analysis 
is applicable. Equation (5) and the present analysis are also 
limited to diffuser flows with a potential core as depicted in Fig. 1. 
For the entering flow 6/R must be somewhat less than 0.06 and 
at exit it must be less than 0.10. One final restriction, which is 
not apparent in any of the basic relations used, is that the 
analysis does not apply to relatively sudden expansions. In fact, 
just how far this method can be applied to large-angle diffusers is 
not clear. The nature of the restrictions noted indicates the need 
for additional research on conical-diffuser flow. 


Experimental Verification 


The diffuser research which has culminated in the present 
method of analysis included five flows in which separation or 
imminent separation were noted. Two of these used water as 
the test fluid [4] and three used air [11, 12, 13]. Values of the 
primary parameter D were evaluated from the air tests and, in 
fact, these are the basis for concluding that Ross’s critical or 
separation value of about 1.3 probably also applies in conical dif- 
fusers. In the water tests, separation or its proximity was noted 
in a more qualitative fashion. In addition to the five flows just 
described, one additional observation was obtained from the 
proofing tests of the large water tunnel [15] at Penn State. This 
tunnel has a 7-deg total-angle diffuser which expands from a 48 to 
86-in-diam before the “‘first turn” of the tunnel. Initially the flow 
at the 86-in. section was found to separate, but improvement of 
the entering flow prattically eliminated the separation [15]. 

In order to use the data from the six diffuser flows mentioned, 
it was first necessary to compute the momentum thickness 
and Rg at the diffuser entrance. Since no measurements were 
made at this geometric point, some extrapolation of the preceding 
straight-conduit results was required. The performance results 
are presented as the first six points of Table 1. As noted in the 
remarks column, not all of these diffusers evidenced separation 
at the station where the last traverse was taken, but separa- 
tion was imminent. 

Six additional conical-diffuser flows, at or near separation, were 
obtained from the British [7] and NACA [6] programs. It was 
necessary to transform the information presented in these studies 
into the data needed for the analysis as outlined in this paper. 
Sometimes these transformations required approximations and 
estimates, for instance, the axisymmetric momentum thicknesses 
had to be estimated from the measured plane two-dimensional 
ones given. This was accomplished with the aid of the rough con- 
version relation [13] 


6, 1 
(9) 
1 1.37 

+ 7 R 


where the subscripts a and p indicate the axisymmetric and plane 
definitions of the boundary-layer terms and 6* is the displace- 
ment thickness. The velocity ratios were calculated from the 
pressure coefficients given in the reference papers, using the 
potential core relation 

(1 — 


10 
Us 


For the NACA data the diffuser effectiveness, the pressure ef- 
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ficiency n, of Reference [4], was given and C, was estimated from 


the equation 
Ry 


The performance results of these tests are presented as the last six 
points of Table 1. 

Comparison of the values of U/Uo, found at the separation sta- 
tion or at the end of the several diffusers, with the predictions of 
Equation (6), indicates that in half the cases the agreement is 
within 0.02. In four cases the measured values are 0.04 below the 
prediction and in the other two cases 0.06 and 0.08 below. Ex- 
cept for these last two cases, the agreement seems good but con- 
servative. It is quite possible that in some cases the flow may 
have passed the separation point before the measurement station 
was reached. 

Graphical depiction of the experimental] verification is difficult 
because of the many significant variables; however, by grouping 
eleven of the points into three Reynolds-number ranges, Fig. 5 
has been prepared. 


(11) 


Point 2 is the only one that was overpre- 
dicted, but as noted in Table 1, the flow had not quite separated. 
Point 12 from the 23-deg diffuser of Little and Wilbur [6] shows 
the largest discrepancy and no explanation has been found for the 
low velocity ratio—it could be the result of a misinterpretation of 
the basic data. Point 1 (the water-tunnel point) is not presented 
in Fig. 5 but diverges by about 0.06 from prediction. As men- 
tioned previously, the flow in this diffuser was erratic and difficult. 
It would appear from the present analysis that the water-tunnel 
diffuser was designed too close to separation conditions. 


General Comments 


It is concluded that the method of analysis, as represented by 
Equation (6) and depicted in Fig. 4, is suitable for the prediction 
of turbulent-boundary-layer separation in conical diffusers and 
that the initial-momentum-thickness Reynolds number Rg and 
the distance parameter L? are significant factors in predicting 
separation. For the larger values of L*, increase in Rg has an 
adverse effect leading to earlier separation. 

The predictions of the proposed analysis have been compared 
with the available experimental! results on conical diffusers. The 
next step is to compare the results with the conclusions reached by 
other researchers. All three of the research programs referred to 
{4, 6, 7) concluded that diffuser performance decreased markedly 
with increase in the initial relative boundary-layer thickness 
and with increase in angle of diffusion. The observed decrease in 
diffuser performance caused by increase in the initial relative 
boundary-layer thickness is in agreement with the present pre- 
dictions as exemplified by Fig. 3. The angle effect is also verified 
because an increase in angle requires a decrease in length for a 
given area ratio. 

The results of diffuser studies are often presented in terms of the 
area ratio (R/R»)* since they are concerned with diffusers in which 
the length and angle are fixed. This area ratio is intimately related 
to the pressure coefficient and the velocity ratio as shown by Equa- 
tions (10) and (11). It should be noted that the diffuser effective- 
ness or pressure efficiency 7, in (11) varies only slightly with the 
diffuser length [4] but quite considerably between diffusers. The 
relation between the velocity or C,, and the area ratio can be more 
appropriately expressed using continuity relationships as indi- 
cated in Appendix A. The ratio, which expresses the amount of 
diffusion (reduction in mean velocity), is related to initial radius, 
angle a (total diffuser angle), and length, z, in conical diffusers by 


the relation 
(: + = tar = 
Ro 
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(12) 


4 


Fig. 5 Experimental verification of separation predictions (encircled 
numbers refer to Table 1) 


For a given area ratio, Equation (12) defines a whole family of 
diffusers of various angles and lengths. This paper is not pri- 
marily concerned with geometric relationships, but with the ef- 
fects of the various parameters on the flow separation. These 
parameters are presented so that their individual effects are evi- 
dent, cf. Figs. 3, 4, and 5. 

For the design of diffusers one would be more interested in the 
pipe Reynolds number than in Rg. This more convenient 
parameter can be computed easily, however, since the pipe 
Reynolds number may be expressed as (2/)/@)Reo. This rela- 
tion again illustrates the importance of 4/R, in predicting separa- 
tion in conical diffusers. Therefore, the extension of the methods 
proposed here to the design problem appears fairly straightfor- 
ward and is indicated in Appendix B. One wonders, however, 
how valid this method of prediction is when applied to relatively 
short wide-angle expansions since the studies on which this 
analysis is based were restricted to angles of 10 deg or less. Com- 
parison has been made to 12 and 23-deg diffusers, cf. Table 1, but 
only in terms of the ratio between the final and initial core veloci- 
ties. An extrapolation based on the method described in this 
paper would indicate that incredible velocity ratios of 0.7 to 0.9 
(depending on the Reynolds number) are possible without separa- 
tion in sudden expansions (@ = 180 deg). This is an impossible 
condition; therefore, more detailed experimental information on 
conical-diffuser flow (especially for wide angles) is needed to verify 
the conclusions of this analysis. 
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APPENDIXA 
Determination of C, 


The pressure variation for design problems is not known and 
must be calculated from the continuity relationship using 6*, the 
displacement thickness, 


° 
* Re#3" WATER 
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Fig. 6 Variation in shape parameter for conical-diffuser flow 
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0 


For any given problem the velocity profile fi.ec., u/U = func- 
tion (y/5)] is completely defined [11] and this identity may be 
evaluated numerically to determine 5*. Numerical methods 
are tedious and time-consuming; therefore, 5* can be more 
quickly approximated from the rough empirical correlation of 
Fig. 6 which shows the variation of H = 6*/@ with D. 

The relationship between C, and the area ratio can now be 
found by solving for U in the continuity relationship 


= (1-25) 


and substituting in Equation (10) to give 


U (Ro — 28o*) Ro 

ed 
APPENDIXB 


Diffuser Design Method 
Although this paper is not primarily concerned with design 
methods, it appears desirable to extend the analysis to de- 
sign problems. The design approach for a conical d.ffuser of area 


2 
ratio (2 = 3 is outlined hereafter. 


Ro 
conditions are Rg = 104 and &/Ry = 0.02. 


a Choose a trial value of U,/Uo; this choice is facilitated by 
reference to Figs. 3 or 4. In the example try 0.6. 
b Verify the suitability of the trial value. 


(1) Calculate @ from Equation (5). 


The diffuser-entrance 


In the example 


6 1 1 \*45 
= (3) = 5.61 


6 Ro 1 


(2) Estimate the displacement thickness 5* using the empirical 
correlation of Fig. 6 which when D, = 1.3 gives H, = 3.1. In the 
example 


(3) Calculate U,/U> from Equation (13). This requires the 
initial value of 6* which may be found from H, given in the upper 
plot of Fig. 6. In the example (Ro = 10*) 


5% 


— = Hy, — = 1.31 X 0.02 = 0.026 
Ro Ro 


and for the velocity ratio 


U, 1 — 2 x 0.026 
Us 31 —2 xX 0.20) 


= 0.54 


c_ By trial-and-error methods the initial and calculated values 
of U,/Uo are brought into agreement. In the example, a final 


value of 2 = 0.586 is found. The solution is very sensitive to 
0 


the velocity ratio. Therefore, the last few values of 6*/R should 
probably be determined from the velocity-profile relations rather 
than by approximation from Fig. 6. 
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d The diffuser length and angle are now defined by Equations 
(6) and (12). Thus in the example, using the known value of 
U,/U> = 0.586 in Fig. 4 the length parameter L* is about 6.0 X 
10%. Therefore ; 


z 
— = 6.0 X 10? X 0.02 = 10.95 
Ro 


e Asa final check, it is necessary to make certain that the re- 
sults lie within the two known limitations of the method. First, 
the diffuser length should not exceed about 60 times the initial 
boundary-layer thickness 5), which is found from Equation (7) or 
Fig. 2. In the example 5./R, = 0.25, whence 


Ro _ 10.95 
0.25 


Second, 6/R at the end of the diffuser should not exceed the value 
for fully developed flow which is about 0.10. In the example using 
U,/Us = 0.586 the final momentum-thickness ratio is 


6/R = 0.071 


Therefore the example solved lies within the known limitations of 
the method. 


DISCUSSION 
Virgil A. Sandborn‘ 


The method outlined for predicting separation in conical dif- 
fusers appears to yield quite accurate results and, as such, the 
authors are to be congratulated on a useful engineering solution of 
the problem. One may suspect that the use of a one-parameter 
type of analysis for turbulent separation, such as employed in 
the subject paper, will be too restricted for the wide number of 
flows possible. The restriction of the parameters in the present 
solution can not be considered a fair criticism of the paper, since 
no less restricted analysis is available. However, it should not 
be out of place here to point out a possible inconsistency in the 
one-parameter assumption, in hopes that it may stimulate think- 
ing toward further analysis of this difficult problem. 

The Darcy-Prandtl relation for the velocity profile in the outer 
region of the boundary layer, 

1—u/U = Dil — y/d)'4 (14) 
is a very reasonable form for the turbulent layer. The present 
, reviewer has studied this type of equation in a general form in 
which the power 1.5 is replaced by an arbitrary power m. Equa- 
tion (14) is, of course, made more complex by the inclusion of an 
arbitrary power, but the increased generality is striking for when 
we set D = 1 and m = 1 the equation beomes the exact solution 
for the special case of Couette flow and with D = 1 andm = 2 
the exact solution for Poiseuille flow is obtained. Clauser® has 
shown quite conclusively that the outer region of turbulent 
boundary layers behaves exactly as a laminar boundary layer. 
Thus, one might well expect the simple cases (of Poiseuille and 
Couette flow) to appear in any general relation for the outer re- 
gion of the turbulent boundary layer. 

If one considers equation (14) and assumes that the integral 

4 Aeronautical Research Engineer, National Aeronautics and 
Space Administration, Lewis Research Center, Cleveland, Ohio. 


’ F. H. Clauser, “The Turbulent Boundary Layer,” vol. IV of 
Advances in Applied Mechanics, Academic Press, Inc., 1956, pp. 1-51. 
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parameters, momentum, and displacement thickness, depend al- 
most entirely on the outer region, then we may evaluate them in 
terms of D. (The assumption that the integral parameters de- 
pend on the outer region of the layer is fully equivalent to the 
assumption of similarity of the outer region made by Clauser.) 
From their definition 


= (1 — u/U)dy/é (15) 


D 
2.5 
1 
6/6 = u/U(1 — u/U)dy/é 
D_» 
2.5 4 
Thus, 


6/5 = — 1.56(6*/5)* (17) 


Clauser finds two constants for each set of equilibrium flows he 
studied, and if these constants are related to 6* and @ the follow- 
ing equation is obtained 


0/6 = — (6*/6) 


(18) 


where G and A/é are the two constants. Equation (17) shows 
that the profile represented by equation (14) is just one of the 
family of possible profiles studied by Clauser. 

If D is set equal to 1.3, as it is in the subject paper at separa- 
tion, then equations(15) and (16) give the following value for pro- 
file form factor 

6*/0 = 5.3 (19) 
In physical measurements the value of 6*/@ does not exceed 3. 
(If instead of a value of 1.3 a value of 1 were used for D, then a 
value of approximately 2.7 is obtained which is more consistent 
with observations.) The subject paper obtains D quite dif- 
ferently from that suggested here. Thus, the discrepancy sug- 
gested by relation (19) does not appear in the final results. For 
further work on the problem it would be desirable to bring such 
results as equation (19) into a more reasonable line with experi- 
mental results. 


Earl M. Uram® 


The understanding of the phenomenon of boundary layer sepa- 
ration and its prediction is indeed a perplexing problem confront- 
ing fluid dynamicists and engineers. Until the physics of the 
situation is understood, empirical methods must be resorted to in 
order to obtain immediately useful results. According to the 
evidence presented in Fig. 5, the present paper offers a most 
commendable example of such an endeavor applicable within the 
limitations clearly described by the authors. 

One of the limitations not explicitly pointed out in the paper is 
the basic assumption that the occurrence of separation is asso~ 
ciated with a particular value of an effective shape parameter of 
the velocity profile. This criterion must be used with caution as 
evidenced by many previous attempts which, although not proved 
incorrect, have been rendered ineffectual by the introduction of 
new data. Since the parameter D is shown to be correlated with 
the shape parameter H in a rather unique sense for both two-di- 
mensional and axisymmetric conical flows, it is apparent that the 
value of the shape parameter H is mainly dictated by the outer 
region of the boundary layer. The use of D = 1.3, which is com- 


® United Aircraft Corporation, Research Department, East Hart- 
ford, Conn. 
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pletely justified by the deductions of the authors from the 
representative conical diffuser flows, effectively says that on the 
basis of present evidence we have revised our opinions that bound- 
ary layers will separate when their values of H achieve a value of 
about 3 instead of 1.8 as was used several years ago. It is well 
known that separation has been encountered for values of H 
anywhere between the values of 1.8 and 3.0, and it would seem 
that axisymmetric flows tend toward the higher values because of 
the extra degree of freedom resulting from the axial symmetry of 
the flow. It is difficult to suppress the feeling that it is likely that 
new data will inevitably not conform to the shape parameter 
criterion, since this criterion does not take into account or de- 
scribe the basic physics involved in the separation phenomenon. 

Regardless of the shape parameter value chosen to depict 
separation, the structure of the analysis built upon this criterion 
and the deductions of the authors are still seen to be valid. One 
of the most interesting deductions is that arising from Fig. 4. 
The authors point out that it is possible, under certain conditions 
of the length parameter L?, for an increase in Reynolds number of 
the flow to have adverse effects upon the separation characteris- 
ties of the boundary layer. Until now it has generally been felt, 
without the aid of a formal analysis as presented by the authors, 
that an increase in Reynolds number would always tend to de- 
crease the possibility of separation; e.g., an increase in Reynolds 
number (usually accomplished by increasing the velocity) is al- 
ways accompanied by a decrease in initial momentum thickness 
which, in accordance with the author’s findings, would forestall 
the occurrence of separation to a later point. It is apparent, how- 
ever, that at some point the z/Rp effects tend to counteract the 
beneficial effects of the reduction in initial momentum thickness. 
One could not be aware of this fact on an intuitive basis. It is 
of interest to mention here that results and analysis of tests of 
annular diffusers at the United Aircraft Corporation Research 
Department are in complete accord with the deductions of the 
analysis presented by the authors. 

As expressed or implied by other investigators, boundary-layer 
separation seems to be grossly a function of only the initial mo- 
mentum distribution or momentum loss and, as pointed out in the 
present paper, to be a function of the length of pressure rise, but 
has no dependence upon the local pressure gradients or the charac- 
ter of the pressure distribution within the diffuser. It seems that 
any physically correct analysis for describing the occurrence of 
separation would have to take these latter factors into account. 
It has been the experience of this writer that previous analyses 
based upon a characteristic value of shape parameter as indicat- 
ing a separated velocity profile work only for those pressure dis- 
tributions in which the local nondimensional pressure-gradient- 
per-unit-length, 1/U; dU,/dz, is a monotonically decreasing 
function in the flow direction. Such prediction methods will 
always forecast the occurrence of separation. However, there are 
many cases of pressure distributions in which the nondimensional 
pressure gradient per unit length decreases and then increases 
again (the pressure distribution has an inflection point) and for 
which separation never occurs. A case in point in which this type 
of separation prediction criterion would break down would be 
that for an equilibrium boundary layer generated in an adverse 
pressure distribution. A very recent analysis by B. S. Stratford? 
arrives at a separation criterion which involves the local pressure 
gradient at separation and attempts to offer a correction which 
takes into account the second derivative of the pressure distribu- 
tion or the inflection characteristics of the pressure rise curve. 

In conclusion, this writer takes pleasure in commending the 


authors for their most useful contribution to a very perplexing 
problem. 


7 B. S. Stratford, Preciction of Separation of the Turbulent 


Boundary Layer,” Journal of Fluid Mechanics, vol. 5, part 1, January, 
1959. 


Stafford W. Wilbur® 


The authors have presented an empirical method for the pre- 
diction of separation in conical diffusers that should be of value 
both in the design and in the analysis of many basic flow systems. 
Since the state of the knowledge of boundary layer growth in re- 
gions of adverse pressure gradients does not approach that re- 
quired to derive diffuser design information, the prediction of 
diffuser flow characteristics must necessarily be based on em- 
pirical methods. The attainment of a satisfactory method for 
predicting separation in such systems is an important step in the 
understanding of this basic problem. 

The authors’ method for the prediction of separation in conical 
diffusers shows remarkably good agreement with the experimental 
results of other researchers. This agreement depends upon 
several assumptions and approximations that were necessary in 
the transformation of the limited experimental data to a form 
suitable for comparison with the present method of analysis. As 
noted by the authors, additional experimental data are needed to 
verify this method, while the subsequent determination of 
parametric limits will be necessary to define the limits of ap- 
plicability. 


Authors’ Closure 


Two of the discussers have raised questions about the velocity 
profile near separation and the shape parameter H. They are 
concerned with the correlation between the outer flow parameter 
D and H, since Fig. 6 suggests that this is unique. If it were, the 
separation value of D = 1.3 would infer a specific value of H, 
but as these discussers note this is hardly to be expected. In 
preparing the paper the authors were also disturbed by the ap- 
parent simplicity of the result implied by the data shown in Fig. 6. 
It was for this reason that the effect of Reynolds number (or c,) 
was indicated for the plane case. In the axisymmetic case 
another parameter enters in to modify the H-D relation, namely 
the relative boundary-layer thickness, It was not possible to 
separate the points of Fig. 6 in regard to these two variables and 
the labor involved in calculating the family of curves which must 
describe the situation was considerable. However, as noted in 
Appendix B, the final refinements of diffuser design probably re- 
quire specific velocity profile calculation. 

Mr. Sandborn uses the outer law of Equation (1) or (14) to 
evaluate H in flows approaching separation. This is incorrect 
for D values above unity where the relation indicates negative 
velocities near the wall. It is thus not surprising that for D = 1.5 
the unbelievably large value of H indicated by Equation (19) is 
obtained. If the relation is integrated from the point (y/d = 
0.161 for D = 1.3) where zero velocity is indicated a more reasona- 
ble H value results. Thus for D = 1.3 an H value half that 
of Equation (19) is found. This value is somewhat low, as is 
apparent from Fig. 6, but is much more reasonable. The details 
of the exact form of the velocity profile at small y/d values are 
important in the integral definitions of Equations (15) and (16). 
Use of the outer law only is rather misleading, even for plane 
flows. For the corresponding axisymmetric (inside conduit) 
flows the integral relations for 6* and @ (as given in the paper) lead 
to even greater dependence on the wall region of the profile. 

These comments on the velocity profile are clarified through 
reference to actual plots of the profiles. Since none were pre- 
sented in the paper, Fig. 7 has been prepared showing several 
profiles calculated for D values ranging from that for negligible 
pressure gradient (0.3) to that for separation (1.3). The calcula- 
tions are made at one Reynolds number; this is an artificiality 
since in an actual boundary layer the Reynolds number would 
increase as the flow progressed. The Reynolds number enters 


8 National Aeronautics and Space Administration, Washington, 
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Fig. 7 Typical velocity profiles for range in outer parameter ) showing 
wall and outer laws (dashed portion indicates faired profile curves be- 
tween two laws) 


into the specification of the profile through the skin-friction 
coefficient in the wall-law portion of the profile; c, or Rg is really 
a second parameter involved in specifying the profile. The 
wall-law relation used (9) is 


u 
= 56(1 + tog’) 
Ur v 


where u; = UV ,/2. Between the outer limit of this law 
(at y/d of 0.1 to 0.15) and the inner limit of the outer law (at 
about y/5 = 0.25) the profile must be faired by eye, or blended 
as Ross [9] termed it. This fairing is shown by the dashed por- 
tions of the profiles in Fig. 7 and is not difficult to perform except 


(20) 
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for the separation profile. At separation the skin friction® is 
zero and the velocity given by the wall-law is zero. However, 
just before separation this is not so and some indication of the 
profile in the faired region can be inferred from the preceeding 
profiles. Thus the profile for D = 1.2 has been used to estimate 
the profile shown for D = 1.3. The error inherent in the use of 
Equation (1) or (14) to describe the complete velocity profile is 
apparent from Fig. 7, where the extended outer-law profiles for 
several D values are shown by the dot-dash lines. 

The authors concur with Mr. Uram that the details of the 
flow conditions immediately preceeding separation must exert 
some influence on separation. As he notes, it is just such details 
which are glossed over by the analysis method used. Unfor- 
tunately, as is apparent from Table 1, too little data on separation 
are available to assess the local effects. If the experimental 
agreement shown in the paper is not just fortuitous, it would 
appear that the principal factors have been included in the pre- 
diction of separation. Mr. Uram questions the applicability of 
the analysis to flows in which the pressure gradient does not 
simply increase with distance, as in a conical diffuser. In one of 
the studies [13] which led to this paper, the boundary layer was 
studied in a diffuser followed by a straight conduit. This did not 
produce an inflection in the pressure curve—a contraction is 
required for this—but a considerable change in flow conditions 
was achieved. The boundary layer approached close to separa- 
tion (D = 1.1 and H = 2.6) and then receded from it (to D = 
0.9 and H = 2.2). Since the D variation with distance in this 
case agrees with the analysis, ie., Equation (3), no separation 
would be predicted and none was found. Since the pressure 
variations for equilibrium boundary layers are even smoother than 
in the case just discussed, it is hard to see why an equilibrium 
boundary layer would lead to any more divergence between the 
present type ot analysis and experiment. 

The authors greatly appreciate the discussions of Messrs. 
Sandborn, Uram, and Wilbur and thank them for their comments 
about the hoped-for applicability of the analysis method. 


* The skin friction coefficient in this analysis is a function of both 
R, and D as shown by Ross (9). 
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Unsteady Transfer of Momentum and 
Heat Between Concentric Cylinders 


Problems of the molecular transfer of momentum and heat in a fluid between two 
concentric cylinders are often encountered in the field of lubrication. It is shown herein 
that these problems can be solved by a generalized Fourier analysis, and that, in par- 
ticular, varying boundary conditions and nonhomogeneous terms in the diffusion equa- 
tion which are time-dependent can be treated by the Duhamel method from the theory 
of the conduction of heat in solids. Heat generated by viscous shear, as well as that 
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Velocity Distribution 


wo typical cases of laminar flow have been in- 
vestigated. In the one, the motion of the fluid is due to an un- 
steady rotation of one cylinder.! In the other, the motion is due 
to an unsteady pressure gradient in the longitudinal direction. 
The veloicty components obtained in these two cases can be 
superposed if the motion is caused by both a rotation of one cylin- 
der and a longitudinal pressure gradient. In the treatment of 
these two cases, the velocity distributions are given by Fourier 
expansions in terms of Bessel and associated functions. 

Motion Due to Rotation of One Cylinder. In the first case men- 
tioned, the fluid, initially at rest, is set in laminar motion by the 
inner cylinder (with radius a) which suddenly starts to rotate 
with an angular velocity w at the time ¢ = 0. If cylindrical co- 
ordinates r, 0, and z are used, and if the corresponding velocity 
components are denoted by u, v, and w, the Navier-Stokes’ equa- 
tions of motion are 


(1) 


ov -»(2 (2) 


ot or? 


1 The more general case in which both cylinders rotate can be 
treated similarly. 

Contributed by the Fluid Mechanics Subcommittee of the Hy- 
draulic Division and presented at the Hydraulic Conference, Ann 


Arbor, Mich., April 13-15, 1959, of Tae American Society or Me- 
CHANICAL ENGINEERS. 


externally applied, is included in the analysis. 


in which p is the pressure, and p and vy are, respectively, the 
density and the kinematic viscosity of the fluid. The velocity 


components u and w are obviously zero. In order to deal only 


with dimensionless quantities, one makes the following substitu- 
tions 


according to which (1) and (2) assume the new forms 
op 
= —, 4 
(4) 
eee 


If , and ¢, are the steady and unsteady parts, respectively, of ¢, 
one has 


$= + (6) 


Then ¢,, which must satisfy (5) with the left side equal to zero 
and the boundary conditions at § = land & = k=b/a(b = 
radius of outer cylinder), is given by 


(7) 


The unsteady part ¢, has to satisfy (5) and the following bound- 
ary conditions: 


Norte: Statements and opinions advanced in papers are to be ; = 
understood as individual expressions of their authors and not those of (i) OQ, ~0 as r+; (ii)d, = —d at r=0 
the Society. Manuscript received at ASME Headquarters, January 


22,1959. Paper No. 59—Hyd-17. (iii) & =O at —&=1landk 


Nomenclature 


a,b = respectively, radii of inner t= 


time To, T, = reference temperatures 
and outer cylinders r, 0, 2 = cylindrical co-ordinates T, — T. = temperature rise due to vis- 
k = b/a u, », w = corresponding velocity com- cous shear alone 
p = density ponents h = rate of heat transfer per unit 
= dynamic and kinematic vis- longitudinal distance 
cosities, respectively h = dimensionless rate of heat 
c, = specific heat at constant M = torque transfer, see (31) and (32) 
volume if = dimensionless torque, see J = Joule’s value 
kK, @ = respectively, thermal con- (13) and (14) G(é), = functions giving temperature 
ductivity and diffusivity —K = pressure gradient H(£) rise due to viscous shear, 
o = v/a, Prandtl number T = temperature see (42) and (43) 
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Fig. 1 Establishment of rotational motion between concentric cylinders, 
owing to a sudden rotation of the inner cylinder 


By the well-known method of separation of variables [1],? it can 
be readily shown that 


bu = + (8) 
n=1 
in which J; is the Bessel function and N, the Neumann function, 
each of the first order. The eigenvalues A, and the values 8, 
are determined by (iii), which is equivalent to 


Ji(A) + BNA) = 0, Ji(kA) + BNi(KA) = 0 (9) 


Equations (9) possess an infinite number of pairs of roots. To the 
eigenvalues \, there correspond the eigenfunctions 


Because of the orthogonality [2] of the eigenfunctions and the 
properties of J; and N, [3], the coefficients A, in (8) are deter- 
mined from (ii) by 


in which Z) = J) + 8,No, Jo and No being, respectively, the Bessel 
function and the Neumann function of order zero. The values 
\,, 8,, and A, up ton = 6 are tabulated in Table 1 for the case 
k = 2. The eigenvalues A, are taken from [3] (page 204) where 
they are given also fork = 1.2,1.5,and . In Fig. 1, @ given by 
(6), (7), and (8) is plotted against £ for different values of r. 
After @ has been determined, a single integration of (4) gives 
p asa function of fand7r. From the definition of p given in (3) 
it can be seen that the pressure difference between any two points 
in the fluid at any time is proportional to w*. For large values of 
w, the difference between the pressure acting at the outer cylinder 
and that at the inner cylinder can be very large. For the steady 


? Numbers in brackets designate References at end of paper. 
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2Z0(A,) 
— 


(10) 


4 


12.5812 
—0.9147 
—0.5202 

1.9927 
5.6412 


state and k = 2, the difference between the values of p at & = 1 
and £ = 2 is given by 

2 

— Pr — dt = 0.2172 

so that 
P2 — Pi = 0.2172pw'a? 

The shearing stress at any point in the fluid is 


(¢) 
aa r 


In virtue of (9) and the identity [2] (page 145) 


(13) 


_ 


(14) 


n=1 


in which M, and s; are, respectively, the total moment per unit 
length and the shearing stress acting on the inner cylinder, and My 


6 


i 


0.5 
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, respectively, for a sudden rotation of inner cylinder. 
‘Those for a gradual rotation of inner cylinder. 
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Table 1 

1 2 3 i] 5 6 ies. 
3.1966 6.3124 9.4445 15.7199 18.8595 

—0.7100 —0.8380 —0. 8884 —0.930 —0.96 ; 

te 1.1079 —0.7578 0.5811 0.460 —0.42 ~ 
119064 1.9729 «1.9017 1904 2.00 
—5. 4580 5.5992 —5.4480 —5.653 5.64 
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a 
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and 8, are those acting on the outer cylinder. For k = 2, the co- 
efficients of the exponential functions in (13) and (14) are given in 
Table 1. As n increases, these coefficients will approach 2 for 
(13) and (—)"4./2 = (—)" 5.657 for (14), respectively. No great 
errors will be introduced if for n > 6 these values are used, par- 
ticularly if 7 is not extremely small. Equations (13) and (14) are 
represented graphically by Curves I and II, respectively, in Fig. 2. 
It can be seen from Fig. 2 that initially M, is infinite and M, is 
zero, and that the former will decrease and the latter will increase 
with time, until asymptotically they approach the same limit 
corresponding to the steady state. 

One is now in a position to calculate the velocity distribution for 
the case in which the inner cylinder rotates with a variable speed, 
say wf(T), w beng a constant, and f(r) an arbitrary function of r. 
The solution is 


wa 0 dr; 


where @¢, and @¢, are given, respectively, by (7) and (8). That 
(15) satisfies (2) and the boundary condition at the surface of the 
outer cylinder is a consequence of the fact that @, + @, satisfies 
(5) and the same boundary condition. It remains only to show 
that (15) satisfies the boundary condition at — = 1. But at 
— = 1, + = 1, consequently 


wa o 


as required. It is interesting to note the heuristic derivation of 
(15). The area under the curve of the function f(7) can be con- 
sidered to be built of semi-infinite strips of various constant 
widths. To the first strip with a width equal to f(0) are added, at 
time 7;, strips of widths df(7,). To the strip with width f(0) there 
corresponds the solution f(0) (¢, + @,), and to the one with 
width df(7,) beginning at r = 7; there corresponds the solution 
(od, + o,(7 — 7:))df(71). Superposition of the effects of all the 
strips gives the solution (20). This method of superposition is a 
contribution of Duhamel to the theory of heat conduction in 
solids [4] (page 19), and has often been applied to problems of 
unsteady fluid flow [5]. 

From the velocity distribution given by (14), the torques on the 
two cylinders can be calculated easily. If the inner cylinder 
starts to rotate gradually, and finally its angular speed approaches 
a definite value w, a reasonable form for f(r) is 


f(r) 
Substitution into (15), (13), and (14) yields 


(16) 
2k? 
k*cA, A, Zo(kAn) 


(17) 


n=1 


Table 2 


For k = 2, c = 20, equations (16) and (17) are represented by 
Curves III and IV, respectively, in Figs. 2. Although the varia- 
tions of M, and M, depend on k and on the variation of angular 
speed of the inner cylinder, for any final angular speed w of the 
inner cylinder the principle of conservation of angular momentum 
demands the following integral relation: 


@ b 
f, (M, — M2)dt = 2rpwa o,r°dr (18) 


where @, is given by (7). In virtue of the definitions of M, and J, 
given by (13) and (14) and the definition of 7, this relation can be 
written for any arbitrary values of w and k, and any arbitrary 
function f(r), in the following way: 


@ k2 1 
(M, M:2)dr = = (19) 
0 4 


Motion Due to a Longitudinal Pressure Gradient. The distribution 
of velocity as a function of time and position is sought for the case 
in which the fluid, initially at rest, is set in laminar motion by a 
longitudinal pressure gradient —K applied at t = 0 and main- 
tained thereafter. 

Since u and v and all derivatives with respect to @ and z are zero, 
the Navier-Stokes equation of motion for w is 


ow K ( 4 1 (20) 
ot or? r Or 
Retaining the meanings of 7 and £ and defining 
= 21 
(21) 
one has 
re) 1 
= +1 (22) 
or of 


By a procedure similar to that employed in the preceding sec- 
tion, one finds 


n=1 
in which Z) = Jo + 8,No, and 
1 k? —1 
= - — £2 ] 1 2. 
2 
A, 25) 


and X,, and 6, are solutions of 
+ BNA) = 0, Jo(kA) + BNo(kA) = 0 (26) 


the function Z, being defined as J; + 8,Ni. The values for X,, 
8,, and A, up to n = 6 are given in Table 2. 

Equation (28) is plotted for various values of 7 in Fig. 3. At 
first, only the fluid near the boundaries feels their influence, the 


n 1 2 4 5 6 
he 3.1230 6.2734 9.4182 12.5614 15.7040 18.8462 
Bn 0.8922 0.9430 0.9610 0.9703 0.9764 1.0000 
Re 0.2644 0.00776 0.01600 0.00133 0.00439 0.00047 
1.5077 1.0316 0.8310 0.7189 0.6353 0. 6569 
—ByrnZi(rn)  —2.8656 2.8378 —2.8246 2.8337 —2.7955 
4.0352 4.0086 3.9962 4.0040 3.9688 40 
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Fig. 3 Establishment of longitudinal flow, owing to sudden application 
of a constant pressure gradient 


bulk of the fluid moving with an almost uniform velocity. Then, 
the effect of the boundaries spreads gradually throughout the 
whole fluid. 

The shear stress at the boundaries can be computed in a 
straightforward manner from (23). Since the speed is increasing 
monotonically, these stresses approach their steady-state values 
asymptotically from below. 

If the pressure gradient is Kf(7), then application of Duhamel’s 
method gives 


pw 
= (tT -—T 
ak f 
in which ¥(r — 7;) is given by (23) on changing rt to tr — 7, and 
¥ is given by (23). 


dr, + 


Temperature Distribution 


If the temperature 7 is a function of r and ¢ only, the equation 
governing the distribution of 7(the energy equation) is, for an in- 
compressible fluid, 

1 oT 
r 


in which @ is the thermal diffusivity based on c,, the specific heat 
at constant volume, J is Joule’s value, and the last bracket stands 
for the dissipation function [6] (page 603). 

The general method for solving (27) is the same as that used in 
the foregoing for solving the equations of motion. For the case in 
which the fluid, initially isothermal, is heated by a sudden tem- 
perature rise at the outer cylinder, the distribution of 7’ can be ex- 
pressed by a Fourier expansion in terms of Bessel and associated 
functions. Problems with other boundary conditions can be 
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solved in a similar way. Varying boundary conditions can 
be treated by the Duhamel method. 

Temperature rise due to viscous shear can be calculated very 
easily for steady flow. This temperature rise is to be superposed 
on that due to external heating, calculated without consideration 
of the heat generated by viscous shear. For unsteady flow the 
dissipation function consists of infinitely many time-dependent 
terms. It will be shown that the resulting energy equation can be 
solved by applying the Duhamel method in a way slightly dif- 
ferent from the usual manner of application. 

Unsteady Thermal Boundary Conditions. The temperature dis- 
tribution is sought for the case in which the fluid, initially iso- 
thermal and at temperature 7’), is heated at t = 0 by a sudden 
temperature rise at the outer cylinder from 7, to 7, while 
the temperature at the inner cylinder is kept at 7). The heat 
generated by viscous shear, for the time being, is excluded from 
consideration. 

By a procedure similar to that used in the section, “Motion Due 
to Rotation of One Cylinder,’’ (see [4] page 278), one finds 


In 


= = 
ink 


+ AE) + B NAA (28) 


n=1 


in which A, and 8, are the nth pair of roots of (26). The co- 
efficients B, are determined from the initial condition to be (Z,; = 
Ji: + 


~ kA, ) — Z,%X,)] 


(29) 


a 


For k = 2, A,, 8,, and B, are given in Table 2, and @ is plotted 
in Fig. 4. 

The rate of heat transfer A per unit longitudinal distance is 
given by 


h = (30) 


where « is the thermal conductivity. 
values of hat & = land & = k, then 


If h; and h, denote the 


hy 1 
In k 


— BA (31) 
n=1 
he 


(32) 


n=l 


For k = 2, the coefficients of the exponential functions in (31) and 
(32) are given in Table 2. For n > 5, those in (31) can be taken 
to be (—1)* 2 \/2, and those in (32) to be 4. The resulting A; and 
h are represented by Curves II and I, respectively, in Fig. 5. 
If the temperature at £ = k is varying with time and equal to 


To + (T; — To)f(r) 


in which f(r) is an arbitrary function of 7, then application of the 
Duhamel method gives 


Yr dr; + f(0)2 


0 


(33) 
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Fig. 4 Unsteady temperature distribution owing to a sudden rise in 
temperature at outer cylinder, thermal effects of viscous shear excluded 
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Fig. 5 Dimensionless rates of heat transfer. Curves | and Il: Rates for 
outer and inner cylinder, respectively, for a sudden temperature rise at 


outer cylinder. Curves Ill and IV: Those for a gradual temperature rise 
at outer cylinder. 


in which Q is given by (28). If f(r) = 1 — e—°7 (where c is an index 
of the rapidity with which the asymptotic temperature 7; at 
the outer cylinder is approached), it follows from (30) and (33) 


that 


Fork = 2andc = 20, (34) and (35) are represented by Curves IV 
and III, respectively, in Fig. 5. 


Per unit longitudinal distance, the total heat inflow must be 
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equal to the heat required to change the temperature of the fluid 
from 7’, to its asymptotic value given by 


To + — To) In EAnk 
Consequently, one has 


k 
(ha — = To) onr in 
1 


k2 
= 2mpc,(T: — To)a? ( :) (36) 


or, from the definitions of h, and hz given by (31) and (32), and 
recalling k = ape,,T = at/a?, one has 


o 
f (he hi = 2 ad 


which is true for an arbitrary asymptotic temperature 7, any 
arbitrary value of k, and any arbitrary manner in which 7; is ap- 
proached. 

Temperature Rise Due to Steady Viscous Shear. Temperature rise 
due to viscous shear for a steady flow about as well as along the 
axis of the concentric cylinders can be obtained by direct integra- 
tion of the energy equation. If the temperature at both cylinders 
is kept at 75, one can set the left side of (27) equal to zero, and 
deal with the resulting ordinary differential equation. In virtue 
of the steady-state values of v and w given, respectively, by (3), 
(7), (21), and (24), this equation can be written 


(37) 


de ae 16 (-2 + ink 
'n which 
T, — To 
Q= Te (39) 


K*oa* K 
To 
T, — To being the temperature rise due to viscous shear alone, 


ws the dynamic viscosity, and o the Prandtl number defined to be 
v/a. 


Integration of (38) gives 


K, = 


(40) 


Q(t) = KiG(é) + (41) 
in which 


1 2 


constants of integration having been determined to satisfy the 
boundary conditions. 

The solution just given is based on the rotation of only one 
cylinder as far as the flow about the axis is concerned. But it can 
be applied immediately to the case in which the cylinders rotate 
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Fig. 6 Function for temperature rise owing to viscous shear for steady 
longitudinal flow 


with two different constant speeds if one replaces w by the dif- 
ference between the two angular speeds. The function G(£) is 
plotted in Fig. 6. The function H(£) has already been given by 
Schlichting [7], but for the sake of completeness is reproduced 
in Fig. 7. In both Figs. 6 and 7, the parameter (— — 1)/(k — 1) 
is used as the abscissa. 

The temperature rise calculated in this section can be super- 
posed on that calculated in the section, Unsteady Thermal 
Boundary Conditions. 

General Method for Calculating Temperature Rise Due to Unsteady 
Viscous Shear. If the flow is unsteady the dissipation function 
represented by the bracket in (27) consists generally of infinitely 
many time-dependent terms, and an equation of the following 
type is encountered: 


mM 


s=1 


(45) 
By the methods developed in the two preceding sections, an equa- 
tion of the type 


or 


with the initial condition f,(£, 0) = 0 can be solved, so that the 
functions f; can be considered as known. Then the function 


(46) 


j 
~ + 


and the initial condition 2; (£, 0) = 0, as can be shown by a 
straightforward calculation. This result illustrates still another 
application of the Duhamel method, which differs from the 
usual application in that it deals with a time-dependent term 
in the nonhomogeneous part of the diffusion equation instead 
of a varying boundary condition. With the 2; thus found, the 
final solution is 


(47) 
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Fig. 7 Function for temperature rise owing to viscous shear for steady 
rotational motion, after Schlichting {7 | 


Varying boundary conditions can again be treated by the Du- 
hamel method as in the foregoing. 


Conclusions 


1 Specific problems of laminar momentum or heat transfer in 
a fluid between two concentric cylinders have been solved by well- 
known methods. The solutions, expressible in terms of Bessel and 
associated functions, have been carried to the numerical stage. 

2 Varying boundary conditions and nonhomogeneous terms 
depending on time in the equation of heat diffusion have been 
treated successfully with the Duhamel method. 
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DISCUSSION 
John A. Clark® 


This is an important application of applied mathematics to 
an interesting problem. After reading the paper the following 
questions come to mind on which I would appreciate the author’s 
comment. Does the author plan any extension of the work to 
include the influence of eccentricity? Would he have a com- 
ment on this in relation to the present results? Would he 
comment on the effect of thermal-dependent properties, especially 
viscosity, on the solutions presented? 


* Professor of Mechanical Engineering, Department of Mechanical 


Engineering, University of Michigan, Ann Arbor, Mich. Assoc. 
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Author’s Closure 


Professor Clark’s questions will be answered in the order in 
which they have been posed. The work reported here was done 
primarily in preparation for the study of the hydrodynamic sta- 
bility of unsteady flows, which has recently been found to be 
amenable to analysis, but is published here because of its general 
bearing on the problems of lubrication. Solutions for steady flows 
(relative to the one and only cylinder that is eccentric) are already 
available elsewhere [5]. Those for unsteady flows between con- 
centric cylinders are, as far as the author is aware, not known 
Although they will be much more difficult to obtain, it is possible 
to obtain them for small eccentricities by expanding the solutions 
in ascending powers of the two parameters characterizing the two 
eccentricities. No such work is planned, because, as stated, this 
work has been done to facilitate future investigations of the hy- 


216 / MARCH 1960 


drodynamic stability of unsteady flows. The present results will 
be useful for studying unsteady momentum and heat transfer 
between cylinders with small eccentricities in the sense that the 
solutions sought are in the neighborhood of the ones presented 
here or of others similar to them. As to the effects of thermal 
variation of fluid properties on the flow and on the temperature 
distribution, they can be determined by a process of successive 
approximation. Starting from the solutions presented here, one 
can correct the viscosity and diffusivity accordingly, and use the 
new (time- and space-dependent) values in the second approxima- 
tion. Care must be taken in the second approximation to put 
the viscosity and the diffusivity in the correct places in the equa~ 
tions of motion and in the diffusion equation, since they are no 
longer constant. The resulting equations will be more compli- 
cated, but still linear, and of a type similar to (45). Consequently, 
they can be solved with some patience and effort. 
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and theoretically. 


Resistance of Enclosed Rotating Disks 


The fundamental fluid mechanics associated with the rotation of a smooth plane disk 
enclosed within a right-cylindrical chamber have been studied both experimentally 
In order to acquire further and systematic information pertinent 
to this problem, which has received much attention in the past, torque data were obtained 
over a range of disk Reynolds numbers from 10° to 10° for axial clearance-disk radius 
ratios s/a from 0.0127 to 0.217 for a constant small radial tip clearance and 
velocity and pressure data were obtained for laminar and turbulent flows. The exist- 
ence of four basic flow regimes in the axial gap between the disk and casing wall was 
verified, and these regimes, the existence and extent of which are governed by the Reynolds 
number-axial spacing combinations, have been delineated. 


A new approximate 


theoretical analysis has accounted for axial-clearance effects for the case of separate 
boundary layers on the disk and end wall; this theory has been checked against test re- 
sults. Velocity and pressure data have shown that the concept of a fluid “‘core’’ rotation 
in the case of separate boundary layers must be modified because vf secondary flows and 


— disks may be classified in two broad cate- 
gories: (a) “Free disk,’’ a disk which rotates in a fluid mass of in- 
finite extent and initially at rest; (6) “enclosed disk,’’ a disk 
which rotates within a chamber of finite dimensions; herein, the 
chamber is considered to be fully closed so that only a finite 
volume of fluid is affected by the disk notion. 

This paper concerns the latter which includes the kinds of con- 
ditions obtained in centrifugal machinery where problems of disk- 


Contributed by the Fluid Mechanics Subcommittee of the Hy- 
draulic Division, and presented at the Hydraulic Conference, Ann 
Arbor, Mich., April 13-15, 1959, of Tae American Socrety or Me- 
CHANICAL ENGINEERS. 


skewed boundary layers. 


friction torque and power loss and of heat transfer are related to 
the circulation and secondary flows induced by the rotating ele- 
ment. These induced flows are dependent on the geometries of 
the rotating element and its enclosure. The investigation re- 
ported here is an attempt to clarify further the fundamental 
aspects of the fluid mechanics involved by re-examining the case 
of a smooth plane disk within an axially symmetric, smooth- 
walled, right-circular cylinder of variable length. There have 


Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, December 
17, 1958. Paper No. 59—-Hyd-9. 


Journal of Basic Engineering 


a = radius of disk, ft -, subscript r, referring to rotor y = distance normal to stator, ft 
Ay;> — = numerical coefficients R Reynolds number, as defined; z = distance normal to rotor, ft 
b = thickness of disk, ft if not defined, R = disk 8 = angular velocity, radians per 
ce = disk radial-tip clearance, ft Reynolds number = wa*/v sec (rps) 
C = numerical coefficient 8 axial clearance between disk 6 = disk boundary-layer thick- 
C,, = torque coefficient, defined by and end wall, ft ness, ft 
rr -, = subscript s, referring to stator 7 = dimensionless distance from 
M = C,, , pwra’, on two u = absolute tangential-velocity solid boundary 
faces of disk component, fps 0 = end wall boundary-layer 
f = friction coefficient u, relative tangential-velocity thickness, ft 
gq = acceleration due to gravity, fpe = dynamic viscosity, lb-sec/ft* 
ft/sec? : absolute radial-velocity com- vy = kinematic viscosity, ft*/sec 
h = piezometric head, ft fps = cylindrical wall boundary- 
reference radial velocity near : 
K B/w, ratio of angular veloci- rotor, fps layer thickness, ft 
ties Ups velenenee radial velocity near p = mass density, slugs per ft? 
M = frictional torque (moment), stator, fps : T = boundary shear stress, psf 
ft-lb v, = relative radial-velocity com- T, = radial component of shear 
M’ = frictional torque on tip of ro- ponent, fps : stress, psf 
tating disk, ft-lb V resultant relative velocity, fps T, = tangential component of shear 
p = pressure intensity, psf wo reference axial velocity, fps stress, psf 
r = radius, ft - subscript w referring to cylin- @ ) = function of 
hydraulic radius, ft drica] wall = angular velocity, rps 
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been several previous studies of this problem, leading to limited 
correlation between experimental disk-friction values and theory 
{1, 2].!. When this program was initiated in 1955 there was no 
theoretical prediction of an effect of chamber proportions on disk 
friction with turbulent flow, which had been observed for a 
limited test range [3], although this has been discussed recently 
for one possible mode of turbulent flow [4]. Neither has there 
been obtained definite information about the circulation of 
secondary flows and the generation of skewed boundary layers, 
nor a clear delineation of the several modes of flow which may 
exist. 

To investigate these questions further a test rig was built which 
would allow the usual disk-friction-torque measurements at dif- 
ferent speeds, and also pressure, temperature, and velocity tra- 
verses in the liquid-filled spaces surrounding the disk. In addi- 
tion, an analysis was made based on a model of the flow which 
recognized all different flow regimes which can exist, and the 
analysis was tested against the measured data. 

Experimental data were obtained for the following ranges of 
variables: Axial clearances between the disk (rotor) and the sta- 
tionary circular end wall of the casing (stator) were varied in a 
number of finite steps over a range of s/a from 0.0127 to 0.217. 
The small radial tip clearance was constant at c/a = 0.00637 for 
all runs. The range of disk Reynolds numbers for the torque 
tests was 10? — 10’. Velocity traverses were made across the 
gap between rotor and stator for turbulent flow with a disk 
Reynolds number R of 4.4 X 10 for three s/a values. For 
laminar flow, the tangential velocity only was determined midway 
between rotor and stator for four s/a spacings at R = 4.6 x 104. 
Pressure distributions were measured on the stator for a number 
of s/a values for both laminar and turbulent-flow conditions. 


Theoretical Considerations 


Classification of Flow Regimes. For the geometry investigated 
the mode of flow in the fluid between the disk and stator depends 
only upon the Reynolds number and the s/a ratio. Four dif- 
ferent modes of flow may be possible for a given geometry (s/a) 
over a range of R. These are: 

Regime |: Laminar Flow, Close Clearance. Boundary layers on the 
rotor and stator are merged so that a continuous variation in 
velocity exists across the axial gap s. 

Regime Il: Laminar Flow, Separate Boundary Layers. The combined 
thickness of the boundary layers on the rotor and stator is less 
than the axial gap s; between the boundary layers is a core 
region in which no change in velocity is expected to occur. 

Regime Ill: Turbulent Flow, Close Clearance. The turbulent counter- 
part of Regime I, for higher Reynolds numbers and turbulent 
flow on the circular surfaces. 

Regime IV: Turbulent Flow, Separate Boundary Layers. In the past, 
various investigators have pointed out the existence of one or 
more of these modes of flow; no single writer, however, has ex- 
plicitly emphasized the presence of all four and their limits of 
occurrence. 

For a given s/a, within practical ranges of R, it may or may not 
be possible to experience all four regimes; the s/a ratio is sig- 
nificant in determining the type of flow. The analytical ap- 
proach to be outlined attempts to account for the effects of the 
axial-clearance ratio through other than a purely empirical cor- 
relation of torque data, and for the separate boundary-layer re- 
gimes considers the effect of the frictional resistance on the 
cylindrical walls of the casing. 

Brief Review of Previous Solutions: Regime /. The relationships for 
this case have been fully discussed in [1, 2,4]. Couette, laminar 
flow is assumed, with no radial-velocity component present, and 
the effect of the friction on the cylindrical walls is ignored. The 


1 Numbers in brackets designate References at end of paper. 
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resulting tangential velocity varies linearly across the gap, from 
zero on the stator to wr on the rotating disk, the velocity gradient 
being du/dz = wr/s. 

Substitution of this value into the equation for the frictional 
torque on the two surfaces of the rotor gives 


and letting tT, = u(du/dz), leads to the expression for the di- 
mensionless torque coefficient 


8s wa (s/a)R 

Equation (2) has been verified in the past by limited experimental 
data. In both [1] and [4], refinements are added to this equa- 
tion to account for radial and axial velocities actually present. 

Regimes Ii and IV. Solutions for the separate boundary-layer 
case generally follow the form of the momentum analysis as first 
used by von Karman [5] in his analysis of the turbulent boundary 
layer on a rotating free disk. Schultz-Grunow [1] wrote separate 
sets of momentum equations, one for the rotor and one for the 
stator, in which a fluid core rotating at angular velocity 8 was as- 
sumed and the effects of the cylindrical wall were ignored. For 
the laminar Regime II the following expressions were obtained: 


C,, = 2.66 R-/? (3) 


(4) 


For the turbulent Regime IV, the comparable expressions are 


C,, = 0.0622 R-'’* (5) 
1 

— = —— = 0.512 6 
w 1.954 (6) 


The foregoing were summarized by Ippen [2}, who also carried 
out an approximate solution for both regimes. The relative angu- 
lar velocity of the disk with respect to the assumed core w — 8 
was simply substituted for the free-disk angular velocity in Coch- 
ran’s final equations of the Navier-Stokes’ solution for laminar 
flow on a free disk [6] and in von Karman’s for the turbulent case, 
assuming 8/w = 0.50 in each case, and neglecting effects of radial 
pressure gradients. For Regime II, this gave 


C,, = 2.600 (7) 
For Regime IV, the result was* 
Cn = 0.0836 (8) 


None of the foregoing expressions includes the effect of axial 
clearance. 

Regime Ili. Recently Soo [4] presented an analysis for this 
region with the axial clearance included as a parameter, obtaining 


C,, = 0.0622 (+) R-“ (9) 


a 


T is expression alone, however, carries no indications of the limits 
ts applicability. 

‘asis of New Analysis for Separate Boundary-Layer Regimes. 
\.. the cases of Regimes II and IV, a proper theoretical treatment 
would indicate the effect of axial clearance and cylindrical wall 
friction. The following gives a new solution for the foregoing 
regimes in which these variables are introduced into a momentum- 
type analysis. 

The following assumptions are made for the condition of sepa- 
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Fig. 1 Notation definition sketch 


rate boundary layers, whether laminar or turbulent, on the disk 
and stationary end wall. The fluid outside the boundary layers 
rotates as a solid body with angular velocity 8 = Kw,0< K < 1.0. 
All radial flow occurs within the boundary layers; fluid flows 
radially outward on the disk in the boundary layer of thickness 6, 
axially away from the disk on the cylindrical wall in the layer of 
thickness £, and radially inward on the stationary end wall in the 
layer of thickness J, as indicated in Fig. 1. For steady-state 
conditions the activating torque of the disk on the fluid must be 
opposed by an equal and opposite torque on the plane and curved 
walls of the casing. Continuity requires the radial flows on the 
two plane circular surfaces to be equal at the same radii. The 
secondary flow on the disk has an angular momentum in excess of 
the fluid in the core, and a corresponding momentum defect exists 
in the secondary flow along the stationary walls. A zero axial- 
pressure gradient is assumed, so that the pressures in the core are 
imposed on the entire body of fluid in the casing. Radial tip 
clearance is considered to be negligible. 

The unknowns of the situation may now be listed: 6, 8, &, the 
boundary layer thicknesses on three surfoces; vow, wo, three 
reference radial (or axial) velocity components, one for each sur- 
face; K, the ratio B/w. 

Velocity distribution is here called an unknown; the velocity 
profiles on the three surfaces are assumed to be affine. 

Equations available are two boundary-layer momentum equa- 
tions on each of the three surfaces, the continuity equation, and 
the moment equation stipulating that the torque of the disk on the 
fluid is equal and opposite to the sum of the torques on the sta- 
tionary surfaces. If velocity distributions are assumed, the num- 
ber of equations exceeds the number of unknowns, a solution 
satisfying the redundant equation cannot be expected, and a 
rigorous solution becomes impossible. Numerical results in the 
solution to be given depend on the following four-step procedure: 


1 Boundary-layer and momentum conditions are found on 
the rotating disk in terms of K. 
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2 A comparable solution is found for the end wall in terms of 
K. 

3 An approximate method is used for the cylindrical walls. 
Instead of attempting to match the velocity distributions at the 
tips of both rotor and stator, the value of & is simply taken as 
equal to 6 at r = a, and the moment on the cylindrical wall is 
calculated in terms of K. 

4 A graphical solution yields the value of K at which, for a 
given s/a, the moment equation is satisfied. Since values of C,,, 
boundary-layer thicknesses, and velocity distribution are also ex- 
pressed in terms of K, these are thus determined for the particular 
geometry. 


The continuity equation is treated as redundant and is not 
satisfied. As the primary aim here is the prediction of torque 
with geometry, and since moment calculations are far less sensi- 
tive to assumed velocity distributions than are continuity values, 
this choice of redundants appears logical. In the following, the 
solution of the momentum equations will involve different 
boundary-layer and velocity assumptions than have been used 
previously. 

The radial-momentum equation for the rotor may be written: 


3 d 
pr dr de — — br dr 
r 


* 
d 
= — | or v'dzidr (10) 


No radial-momentum flux enters through the edge of the bound- 
ary layer, and there is no shear stress on this surface. The pres- 
sure gradient is written as 


d Kor)* 
p (Kur)? = pK%w'r 

dr r 

Likewise for the rotating disk, the momentum equation in the 
tangential direction may be written: 


d 
redrt, = f r? ww dr 
0 


The torque of the rotating disk on the fluid may then be evalu- 
ated: 


(11) 


The resulting moment coefficient for the two faces of the disk is 
given by 


1 
M, = Cu, 2 pwra® (13) 


The comparable four equations written for the stationary end 
wall are, respectively: 


d 
orar f dy +r drt, — Or — dr 
dr 


d 0 
f | dr (14) 
dr 0 


0 
rdrt, = — f ay] dr (15) 
of 0 
M, = 2 dr (16) 
0 
M, = C,,, — pw'*a (17) 
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The moment equations for the cylindrical wall are, for two 
walls of length s 


M, = 2 X 2xa'sr,, (18) 


M, = (19) 


1 
mn 275 


The requirement for torque equilibrium is 


M, = M,+M, (20a) 


This may also be written: 


(Cn = Cr) = + (20b) 


Equation (20b) is solved graphically for each s/a. 

Regime !!. For this case, a simple parabolic velocity distribution 
was chosen to represent relative tangential velocities. For this 
profile there is no velocity discontinuity, but there is a shear-stress 
discontinuity, at the edge of the boundary layer. Absolute tan- 
gential velocities are: 

On the rotor: 


u = wr[l — 2n(1 — K) + n*%(1 — K)] 
On the stator: 
u = wKr(2n — 7?) 


n = z/6 on the rotor and 7 = y/# on the stator. 

The radial-velocity distribution is so selected to make the radial 
component vanish at the edge of the boundary layer. These pro- 
files are: 

On the rotor: 


v = vo 2n — n?)(1 — 9)? 


On the stator: 
= vox(2n — — n)? 


Boundary shear stress components are: 
On the rotor: 


du | ' K 2 
oe = ) 7 
dz | mo 6 


dv 


Tor = 
dz 


On the stator: 


du K ( 2 ) 
Tun = = 


dv 2 o* 
Tor = = 


dy y=0 v 


Substitution of the foregoing velocity and shear terms into the 
momentum equations for the rotor and stator lead to solutions of 
the form vp = A,wr and 6 = const on the rotor, and to vo = Aswr 
and # = const on the stator, where A,, Ao, 6, and & are solvable as 
functions of K. Substitution of these results into equations (12) 
and (13), and (16) and (17), yields solutions for C,,, and C,,, 
which are, in turn, functions of K. Assuming that & = 6,, sub- 
stitution into equations (18) and (19) givesC,,,,, as a function of K. 
The results of these calculations are shown in Fig. 2. 

The data points plotted in Fig. 2 are values for observed ve- 
locity ratios K = @8/w measured at the mid-point of the axial 
gaps, and corresponding C,,-values for the various s/a ratios 
tested. 

Velocity distributions were not measured for the laminar re- 
220 
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Fig. 2. Graphical solution for C,,, and K, approximate theory, Regime Il 


gime, hence the validity of the assumed profiles could not be 
checked; from the degree of agreement between predicted and 
measured values, actual tangential-velocity gradients at the rotor 
must be approximately those calculated. It is interesting to note 
that for K = 0, corresponding to the free-disk case, the equivalent 
free-disk equations become 


Ow = 3.76 R Ve 29) 


6 = 3.34 ( °) 
w 


These compare with the numerical values of 3.87 and 4.4, re- 
spectively, as found by Cochran; the torque coefficient is thus 
only 3 per cent less than that obtained by a rigorous solution. 

Solutions to equation (20) for finite values of s/a are repre- 
sented in Fig. 2 by the intersections of the curve of C,,, with those 
of Cr, + Cry. In Fig. 3, C,,, and K are plotted as functions of 
s/a; data points for s/a spacings tested are also plotted. Further 
refinements in the approximate theory were not considered justi- 
fied without a knowledge of the actual velocity distributions on 
the three surfaces. As measured radial-pressure gradients were 
smaller than assumed, another physical discrepancy is introduced 
into equations (10) and (14), and the net radial outflow thus im- 
plied in the core indicates that affine velocity distributions 
could not be assumed for both disks. 

Regime IV. A “‘one-seventh power law’’ tangential-velocity dis- 
tribution was assumed in preference to a logarithmic distribution 
to facilitate numerical analysis. Within the range of Reynolds 
numbers covered in the tests this power law should give satisfac- 
tory results. The profile forms chosen are assumed effective over 
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Fig. 3(b) K versus s/o, approximate theory, Regime Il 


the entire disk surface; the existence of a laminar boundary layer 
in the central region of the disk, before the transition to a turbulent 
layer, is precluded. 
The absolute tangential velocities are: 
On the rotor: 
— 


'+ Kn (31) 


On the stator: 


u = Kwrn’* (32) 
The same considerations as for Regime II are involved in as- 
suming the radial distributions, with the exponent of the (1 — 7) 
term chosen to give comparable dimensionless measured and as- 
sumed profiles. The profiles used are: 
On the rotor: 


v = vn) (1 — 


On the stator: 
v = voe(n)’ (1 — (34) 


Empirical relationships based upon the one-seventh-power 
velocity law for flow in smooth pipes were used for the shear-stress 


relationships: 
= 0.0225 
y 


where V = the velocity relative to the boundary at a normal dis- 
tance y. 
Letting 


(35) 


V2? =u? + 
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where the subscript r indicates a component relative to the surface 
in question, the shear-stress components on the circular areas 
may be written: 


v, - v 

7, = (0.0225)pV"* 
Vv 
(36) 


‘ 
) v(u,? + 


= 0.0225p ( 


tT, = 0.0225 ( ) u(u,? + (37) 


After substitution, the resulting solutions have the form rm = 


Awor and 
6 = Ag ( ) 
w 


on the rotor, and = and 
v= Adv/w)' 


on the stator, where A; — A, may be solved as functions of K. 
Corrs Come, and C,,,. are determined in the same way as for Regime 
II. The results of the calculation are plotted on Figs. 4 and 5. 
K-values were determined by velocity profiles measured across the 
gap at the respective s/a ratios investigated. 

For the equivalent free-disk condition (K = 0), the results are 


Cp = 0.146R~-"/* (38) 


6 = 0.555aR-"/* (39) 


These compare with the numerical values of 0.146 and 0.526, re- 
spectively, found by von Karman; since the radial-velocity dis- 
tributions assumed in the two solutions are different, the pre- 
dominating influence of the tangential-velocity component is in- 
dicated. 


Delineation of Flow Regimes. Based on the foregoing analysis, for 
a known s/a and R, both the corresponding flow regime and fric- 
tional torque may be predicted. Theoretical torque equations to 
be used are: 


Regime I: 

Regime II: 
Regime III: 
Regime IV: 


Equation (2) 
i 
Cc, = CR-, 
Equation (9) 


C = (s/a) 
C = (s/a) 


For Regimes II and IV, the numerical values of the coefficients C 
are taken from the theoretical curves in Figs. 3 and 5. 

In summary, the foregoing equations are of the form C,, pro- 
portional to and respectively. The in- 
tersections of the straight-line segments on a logarithmic plot 
representing these regimes indicate the extent or existence of the 
various regimes for a particular s/a. These results are summa- 
rized in Fig. 6, which covers the s/a range of 0.01 — 0.20, and 
which extends to values of R which are beyond all but the most 
unusual applications. The indicated C,,-values are undoubtedly 
too low for R > 10’, as the one-seventh-power velocity profile as- 
sumed is no longer applicable at these higher Reynolds numbers. 
From Fig. 6 it may be observed: 


1 Regime I exists for all spacings if R is made sufficiently small. 

2 For asmall s/a, Regime II may never exist. 

3 Fora large s/a, Regime III may never exist. 

4 Regime IV exists for all s/a only if R is sufficiently high; as 
s/a increases, the transition to Regime IV occurs at a lower R. 


It is noted that for s/a > 0.05 (approximately), Regimes II and 
IV cover all but the lowest R range. 
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Fig. 6 Delineation of flow regimes by enclosed-disk theory 
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Experimental Equipment and Procedures 


Test Stand and Accessories. The general layout of the test stand 
is indicated in Fig. 7; a more complete description is included in 
[7]. 

A 5-hp d-c motor with a 125-2000-rpm speed range served as 
the drive unit. Shaft speeds were measured by means of a revolu- 
tion counter and an electric timer. The motor was directly 
coupled to a 2-in-diam stainless-steel shaft to which interchangea- 
ble disks could be attached. Fig. 8 is a detuiled drawing of the 
disk-shaft connection. 

Two smooth, plane, bronze disks of 19*/, in. OD and '/, and '/, 
in. thickness were used. 

The axial length of the chamber was varied in finite increments 
by different combinations of smooth bronze sleeves of 19*/, in. ID 
used with appropriate spacers, mounted in a stationary cylindrical 
casing, Fig. 9, in which smooth circular cover plates (stators) 
also were mounted. Axial symmetry of casing geometry was 
obtained by attaching dummy shafts to the front cover plate. 

Test fluids were water and four solvent-refined, paraffin-base 
commercial lubricating oils; the numerical designations of these 
oils on the figures give their approximate viscosity, in SSU, at 
100 F. A recirculation system containing a storage reservoir and 
a heat exchanger provided cooling of fluid heated during the test 
period. No through-flow circulation was allowed during the 
test runs. Mercury thermometers mounted in recesses in the end 


\ 
OFFERENTIAL 
PRESSURE 
GAGE 


STEEL 


LAYOUT FOR 
HOLES IN DISK 


- LEADS TO STRAINGAGE 
~ 316 in. dio. V4 in dio 
HOL HOLE 


STAINLESS 
\ STEEL PLUG, 


wall gave temperature data during the test runs; these thermome- 
ters were calibrated against copper-constantan thermocouples 
located in the fluid-filled chamber over the range of fluids, geome- 
tries, and speeds used in the program. 

Torque Measurements. Torques were measured by means of SR-4 
bonded strain gages cemented to the inside wall of a recess in the 
shaft; the four water-proofed gages were placed at angles of 45 deg 
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Fig. 8 Detail drawing of disk-shaft assembly 
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from the shaft center line, the resulting bridge circuit registering 
torsional stresses only. Voltage differences from this circuit were 
taken off through coin silver-graphite slip rings mounted on the 
shaft. Torque-circuit excitation was provided by a 6-volt auto- 
mobile battery; a d-c galvanometer served as the indicating de- 
vice. An attenuation circuit provided full-scale meter readings 
for various torque ranges. A static calibration was performed over 
the full galvanometer-seale range for each attenuation; any effect 
of strain-gage misalignment was accounted for by ‘‘free-disk’’ 
runs in air. 

The shaft recess containing the strain-gage bridge was located 
on the fluid side of the shaft seal; in this way the only torque 
measured was that due to fluid friction, no deductions being 
needed for bearing and/or seal friction. The frictional resistance 
of the disk tip was included in the gross torque measurement; as 
all results are expressed for friction on the disk surfaces only, a 
deduction for the calculated tip friction was necessary. 

When the flow in the thin annular gap between the cylindrical 
wall and the disk tip was considered to be laminar, a Couette flow 
was assumed in this space. The equation for the tip-friction 
torque then became: 


wa 
M' = 1% 2rab-a = pv — (2mab)a 
c 


= C,,’ pwas 
3 


The tip-friction-torque coefficient was then 

(41) 

ac Ww 

which was readily calculated for each configuration —R combina- 

tion. The final value of C,, as used in equations and shown on in- 

cluded plots was the difference between C,, (gross), based on the 
total measured torque, and C,,’. 
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For turbulent flow the annulus between the disk tip and the 
sleeve was treated as a two-dimensional duct with no side-wall 
friction, which was converted into an analogous circular pipe. 
The mean velocity in the flow was taken as one half the disk 
peripheral velocity. The shear stress on the disk tip was ex- 
pressed as 


The pipe Reynolds number was determined: 


_4VR wae 


R 
v v 


This gave the following expression for C,,’: 


fat 
(44) 
Sa 


where f was determined from the smooth-pipe curve on a Moody 
diagram. 

The following comparisons indicate the validity of this tip- 
correction procedure. For s/a = 0.0255, final C,, values using the 
'/, and '/:-in-thick disks were identical for equal Reynolds num- 
bers, for both laminar and turbulent flows. In Regime I the ex- 
perimental values of C,, agreed well with the simple laminar 
theory. A limited number of tests were also made with a smooth 
disk, also of 1%/s-in. diam, which tapered uniformly from '/, in. 
thickness at a 1-in. radius to '/,. in. thickness at the tip. Except 
for small s/a, where even this slight taper was important, C,,- 
values for this tapered disk agreed with those obtained with the 
smooth disks within the experimental accuracy. 

Velocity Measurements. Velocity profiles were obtained at three 


radii by a two-step procedure. First, a 2-hole cobra probe was 
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used to determine local velocity directions; then, with Reynolds 
numbers, probe location, and orientation duplicated, velocity mag- 
nitudes were determined by means of a modified pitot-static tube. 
This sequence was adopted in order to obtain accurate directional 
measurements, be independent of any possible axial pressure 
gradients, and allow the probes to be as small as possible. As 
axial-flow components were made small with respect to tangential 
and radial velocities, the probes measured components only in the 
plane parallel to the disk surface. 

Details of the 0.035-in-OD stainless-steel probes are shown in 
Fig. 10. Flexible leads connected the probe openings with a 
diaphragm-type pressure gage utilizing a Schaevitz linear variable 
differential transformer as the transducer. Excitation for the 
transformer was provided by a 10-ke oscillator; the transformer 
output was rectified and fed to the same d-c galvanometer used 
for torque indications. This circuit was calibrated hydro- 
statically. A velocity-probe-traverse-control unit, mounted on 
the front cover plate, allowed the tip of the probe to be moved 


axially and to be rotated about a point. Probe directions were 
read to the nearest 5 min; displacements of the probe perpendicu- 
lar to the plane of the disk were registered to the nearest 0.001 in. 

Pressure Measurements. Radial-pressure distributions on the 
front cover plate were obtained from measurements at four '/s-in. 
static pressure holes at radii of 3, 5,7, and 9in. Three such holes 
also were drilled in the 2'/;-in-long sleeve. All pressure taps were 
connected to a manifold arrangement in turn coupled to a mer- 
cury-water or mercury-oil U-tube. A free-surface, constant-head 
tank connected to the casing through the annulus between the 
shaft and the front cover plate provided a constant reference 
pressure at this 1'/\-in. radius. Static pressures within the flow 
field could be measured at three radii by means of the pitot-static 
tube; these agreed well with wall measurements when extrapo- 
lated to the end wall. Pressures on both the end wall and 
cylindrical sleeve surface were expressed in terms of the piezo- 
metric head differential between these points and that at the 
center of rotation. 
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Fig. 11 Smooth-disk torque data: s/a = 0.0255 
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Fig. 12 Smooth-disk torque data: s/o = 0.115 
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Experimental Results 


Torque Data. Figs. 11 and 12 are representative of the results ob- 
tained for individual spacing ratios tested; Fig. 13 is a summary 
plot of the smooth-disk experimental data. In each plot the data 
points have been connected by smooth curves; in Figs. 11 and 12, 
as many of the four flow regimes as may apply for that particular 
s/a have been indicated by the appropriate straight lines repre- 
senting the theoretical expressions outlined previously. The 
way in which the data follow the slopes of these theoretical curves 
is an external indication of the flow regimes actually obtained; 
the smooth variation from one regime to another indicates the 
gradual movement of the point of transition from one mode of 
flow to another over the disk surface with changes in Reynolds 
number. 

The s/a = 0.0255 data indicate that the merged boundary- 
layer case was present for both laminar and turbulent flow. It is 
uncertain if Regime IV was reached at the higher Reynolds 
numbers for this geometry; velocity traverses indicated merged 
boundary layers persisting at R = 4.4 « 10°. 

Data for s/a = 0.115 (s = 1'/s in.) indicate that within the test 
range Regimes II and IV only were present. It is noted that at 
the highest Reynolds numbers the slope of the torque-coefficient 
curve tends to decrease, a result comparable to that for mean 
skin-friction coefficients on smooth flat plates. 

For this geometry, the intersection of the two theoretical curves 
at R = 1.5 X 10° serves as an estimate of the combination of 
variables at which turbulence originates within the disk boundary 
layer. For this spacing velocity data indicate K = 0.44, or that 
the angular velocity of the fluid ‘“core’’ relative to the rotor is 
0.56w. Writing R in terms of the relative angular velocity w — 8, 
and assuming this value of K to be valid down to the point of 
transition, the equivalent R for the transition to turbulence is 
8.4 < 10‘, which is below the value of 2.8 10° for a free disk in 
air as reported in [8]. Two differences between the flows of free 
and enclosed disks may influence this early onset of turbulence. 
With the enclosed disk the rotating fluid core between the rotor 
and the stator imposes both a different degree of external turbu- 
lence and a radial pressure on the disk boundary layer. The lat- 
ter subjects the radial flow to an adverse pressure gradient, while 


— —— 


the former should have un effect upon the transition since the 
boundary layer is fed from the face of the rotating core to com- 
pensate for the radial outflow. 

Before proceeding to a quantitative evaluation of the ac- 
curacy of the simple theory expressions, it is in order to summarize 
the physical processes of the flow within the chamber, noting 
especially those regions in which the actual flow conditions are not 
in accord with the assumptions made in obtaining the numerical 
values. The major difference is that the theoretical expressions 
assume an abrupt transition from one regime to another over the 
entire face of the disk, whereas the point of transition moves over 
the surface at different Reynolds numbers. The model to be 
discussed is a disk of constant radius having an s/a such that all 
four regimes may exist. 

The simple close clearance theory for Regime I assumes no 
secondary flow. This approximation is evidently quite good at 
the lowest value of R; as R increases, however, a certain amount 
of secondary flow most likely arises, with the tangential-velocity 
profiles becoming more parabolic, leading to steeper velocity gradi- 
ents near the disk and thus higher torques. These effects are 
indicated in the analyses of {1, 4]. With further increases in R 
the boundary layers separate and Regime II is reached, for which 
the radial flow has been considered. As R increases further the 
boundary layer of uniform thickness over the disk surface be- 
comes thinner, until the local critical Reynolds number is reached 
on the disk; the transition to turbulence occurs at the correspond- 
ing local radius. If s/a is sufficiently small, the turbulent 
boundary layers then formed merge to initiate Regime III. 
With still increasing R the transition point, which occurs first at 
the tip, moves inward until the merged boundary layers cover 
the entire disk surface. The analysis for Regime III takes no 
account of any radial flow, but since there is a definite radial flow 
in Regimes II and IV it is not reasonable to expect it to vanish 
entirely in Regime III. As R increases still further the boundary- 
layer thicknesses tend to decrease until the layers separate. This 
separation probably occurs first at the hub of the disk, and with 
further increases of R the point of boundary-layer convergence is 
swept outward and Regime IV is established over the entire sur- 
face. 
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Fig. 13 Summary of smooth-disk torque data 
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Regime I existed only for s/a = 0.0127 and s/a = 0.0255. In 
the former, agreement of data and simple theory equation (2) was 
very good; for s/a = 0.0255, values predicted by the theory are 
somewhat low, indicating a greater deviation from the simple 
tangential-velocity distribution at the higher values of R within 
this regime for wider axial spacings. 

For the four s/a cases in which Regime II was present, using 
the theoretical exponent for R of —'/: to establish the slope on a 
log-log plot of C,, versus R, straight lines were drawn to obtain 
the best fit to the experimental values at this slope. These 
straight lines adequately represented the test data, and the 
empirical coefficients thus obtained were compared with those 
from the approximate theory for the respective s/a geometry. 
The experimental values were the higher in all cases, the dis- 
crepancy ranging from 1.4 per cent at s/a = 0.0255 to O per cent 
at s/a = 0.217. Earlier equations (3) and (7) are thus seen to 
be adequate for most calculations, but deviate further with in- 
creasing 8/a. 

Previous equation (9), used for Regime III, cannot be expected 
to give close agreement with data since radial flows were neglected 
in its development. For s/a = 0.0127, predicted values are ap- 
proximately 15 per cent higher than those measured; for s/a = 
0.0255, predicted values are about 7 per cent high near the center 
of the regime, this discrepancy decreasing near the limits of the 
regime. 

For the three s/a spacings at which Regime IV was present, 
comparison of test and predicted values was made in the same 
manner as for Regime II. Empirical torque coefficients were 1.3 
per cent higher than given by the approximate theory at s/a = 
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0.0255; at the widest spacing of s/a = 0.217, the theoretical value 
was 3.3 percent low. Earlier equation (5) is seen to be too low 
for all spacings; values predicted by equation (8) approximate 
s/a = 0.14. 

On the basis of the torque data obtained; the following strictly 
empirical relationships may be presented for Regimes II, ITI, 
and IV: 

Regime II: 

3.70(s/a)'/* 


C, =— 


i 
m R 2 


Regime IT: 


0.080 
(s/a) R 


Regime IV: 


0.0102(s/a)'/* 
(47) 


These expressions agree with the experimental data to within 
about 1 per cent over the ranges covered. 

Velocity Date. Turbulent-flow velocity distributions in water 
were obtained between rotor and stator at a value of R = 4.4 x 
10 for s/a-values of 0.0255, 0.0637, 0.102, and 0.217, at relative 
radii r/a of 0.433, 0.599, and 0.765. 

Fig. 14, for r/a = U.765 at s/a = 0.0255, shows the existence of 
Regime III. Regime IV existed at this Reynolds number for the 
wider spacings; Fig. 15 is representative of the measured profiles 
and indicates also the profiles predicted by the approximate 
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Fig. 15 Turbulent-flow velocity profiles, r/a = 0.765, s/o = 0.0637, 
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theory for this condition. Both the tangential and radial-velocity 
components are plotted in a dimensionless form. Plotted results 
are not extrapolated to either disk surface; the zone in which 
maximum radial velocities occurred was so thin that it was possi- 
ble to measure the maximum radial component in only a few in- 
stances with the 0.035-in-thick probe. The magnitude of the 
tangential core velocity decreases with increasing s/a, in accord 
with the approximate theory, as seen in the following tabulation 
for Regime IV data: 


Predicted 
K = B/w 
0.454 
0.432 
0.388 


Measured 

s/a K = B/w 
0.0637 0.460 
0.102 0.444 
0.217 0.412 


In all measured tangential-velocity profiles peaks in the profiles 
of relative tangential velocities occur near each of the two disk 
surfaces; in all cases, this peak velocity exceeds the relative 
velocity of the fluid outside the boundary layer. 

For s/a-values of 0.0255, 0.0637, and 0.102, including both 
Regimes III and IV, a radial outflow was measured over the gap 
between rotor and stator. This outflow is greater for the narrower 
spacings, average values v/wr for the three spacings being (ap- 
proximately) 0.02, 0.005-0.01, and 0.004, respectively. This net 
outward flow in the core region, added to the radial outflow in the 
boundary layer on the rotor, requires a correspondingly larger in- 
flow on the stator in order to satisfy continuity. Boundary-layer 
thicknesses on the stator are observed to be greater than those 
on the rotor. 

As indicated in Fig. 15, the predicted radial profile is not ac- 
curate. Measured boundary-layer thicknesses are greater than 
predicted. Appreciable variations in both radial and tangential 
velocities cease at essentially the same distance from the surface. 
The fact that the predicted boundary-layer thickness is within 20 
to 30 per cent of the distance from the surface to the point of 
maximum relative velocity, coupled with the good agreement be- 
tween measured and predicted K-values, leads to predicted tan- 
gential-velocity gradients near the rotor which agree well with 
the actual values. The reason for the agreement between theory 
and experiment with respect to frictional torque may now be 
recognized. 

Gap center-line, tangential-velocity readings only were taken 
for laminar flow, all at r/a = 0.765 forR = 4.2 & 104, at the s/a- 
valies listed. Also listed are predicted values of K for Regime II 
where this regime is applicable. No radial components were 
measured, but radial-pressure distributions for these configura- 
tions indicate a radial outflow through the central portion of the 
gap. 


Measured 8/w 

s/a gap center line 
0.0510 0.46 
0.102 0.44 
0.217 0.36 


Predicted 
0.504 
0.460 
0.386 


As for the turbulent-flow case, the value of K decreases with in- 
creasing 8/a. 

Pressure Data. Static pressure data were taken to verify hy- 
potheses based upon velocity data. These are here listed: 


1 The observed radial outflow in the gap between rotor and 
stator would imply that the radial pressure gradient does not 
balance centrifugal forces on the core fluid; it would then be ex- 
pected that actual radial-pressure gradients would be smaller 
than within a fluid mass rotating as a solid body with an angular 
velocity equal to that found by core-velocity measurement. 


2 An axial flow from stator to rotor must exist over all but the 
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Fig. 16 Turbulent-flow radial-pressure data, s/a = 0.0637 


tip regions of the disks. It would be expected that this flow 
might be accompanied by an axial pressure gradient through the 
central regions of the gap. 

3 A pressure gradient might also be associated with the axial 
flow within the boundary layer on the cylindrical wall. 


Pressures were measured on the cylindrical and stator walls for 
both laminar and turbulent flows, and axial pressure traverses 
were made at r/a = 0.765 for turbulent flow. Turbulent-flow, 
radial-pressure data were obtained for Regimes III and IV, over 
a range of Reynolds numbers spanning the value of R = 4.4 « 10° 
used in velocity runs; no apparent Reynolds-number effects were 
observed, data for each s/a plotting consistently. Fig. 16 is a 
representative plot, showing results for s/a = 0.0637. The 
dashed line represents the pressures to be expected if the “‘core’’ 
fluid rotated as a mass at an angular velocity 8 = Kw determined 
from velocity measurements. The solid line is drawn through the 
data points; an equivalent K-value may be determined from the 
equation: 


Br?  (Kwr)? 
29 29 


h (48) 


For s/a = 0.0637 and 0.102, in Regime IV, radial pressure gradi- 
ents were approximately 90 per cent of those predicted by assum- 
ing a core rotation. For s/a = 0.0255, in Regime III, the meas- 
ured gradient was approximately 80 per cent of that predicted for 
core rotation at the average gap center tangential velocity. 

Fig. 17, also for s/a = 0.0637, is representative of the laminar- 
flow data. For this spacing and for s/a = 0.115, both in Regime 
II, measured radial gradients were only about 70 per cent of those 
predicted by assuming core rotation at the experimental center- 
line velocities. Pressure data at s/a = 0.0255, at which flow- 
regime transitions occurred, showed Reynolds-number effects; 
test points could not be plotted on a straight line as in Figs. 16 and 
17, and measured values were even farther from the predicted 
than for the clear Regime II case. These laminar-flow results 
also indicate a net outflow in the core regions. 

Axial pressure traverses across the gap at r/a = 0.765 for s/a 
of 0.0637 and 0.102 at R = 4.6 X 10° indicated that in each case 
the pressure measured closest to the stator was slightly higher 
than that measured closest to the rotor. As in each case this 
head difference was less than 1 per cent of the total statie head 
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Fig. 17 Laminar-flow radial-pressure date, s/a = 0.102 


measured at the stator, the accuracy of this small indicated pres- 
sure gradient, which is in the direction of the axial flow between 
the disks, is uncertain. 

Axial pressure gradients along the cylindrical wall were deter- 
mined for different R-values for s/a = 0.115, using water, and 
s/a = 0.102, using oil; results are given in Fig. 18. All gradients 
have the same shape, indicating a decrease in pressure moving 
axially away from the rotor, with the gradient decreasing with in- 
creasing distance z. The dimensionless gradients are slightly 
greater for the turbulent-flow case. 


Conclusions 


The following general conclusions may be drawn from this 
study of the fluid mechanics associated with the rotation of a 
smooth, plane, enclosed disk. 


1 Attention has been focused on the existence of four possible 
modes of flow within the casing. These are: 


Regime I. 
Regime IT. 
Regime IIT. 
Regime IV. 


Laminar flow, merged boundary layers. 
Laminar flow, separate boundary layers. 
Turbulent flow, merged boundary layers. 
Turbulent flow, separate boundary layers. 


The possible existence of these modes of flow depends upon the 
Reynolds number —s/a combinations of the system. A summary 
of the best empirical and analytical expressions for frictional 
torque coefficients for each regime are shown in the following 
table. 


Best empirical Best theoretical 
equations equation 
2r 
~ (s/a)R ~ (s/a)R 
Ris = py = $(s/a) 
0.080 _ 0.0622 
~ (s/a)/R'/s 
0.0102(8/a)'/1 Cc 
R' R' ( o(s/a 


Regime 


2 Additional verification was obtained for the simple close- 
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clearance, laminar-flow theory for friction torque in Regime I. 

3 A momentum analysis of the von Karman type refined to 
include cylindrical wall-friction effects has been presented for 
Regimes IT and IV and has shown: 


(a) An accurate prediction of the form of the variation of the 
relative tangential velocity of the fluid in the central region of the 
axial gap with s/a. 

(b) An accurate prediction of the form of the variation of 
torque coefficient with R and s/a. 


4 For Regime III the form of the variation of torque co- 
efficient with R was found to agree with the theoretical analysis of 
Soo, but the variation with s/a deviated from that predicted in his 
analysis. 

5 It has been shown that an external indication of the flow 
regime in the casing is provided by the manner of the variation in 
disk-friction torque over a range of R. The limits of each regime 
for a particular ¢/a are approximately indicated by the intersec- 
tions of the several straight lines drawn according to consistent 
sets of equations on a logarithmic plot. 

6 For Regimes I, II, and IV the theoretical expressions repre- 
sent the experimental values quite closely. 

7 Velocity data indicate the following in Regime IV: 


(a) Boundary-layer thicknesses increase with radius on both 
the rotor and stator. 

(b) The average tangential velocity across the gap between 
rotor and statér decreases with increasing s/a for a constant 
Reynolds number. 

(c) A small radial outflow exists across the central regions of 
the axial gap. 


8 Pressure data show the following: 


(a) For all cases tested, measured radial pressure gradients are 
less than those which would be obtained for a fluid core rotation 
with no radial-flow component. 

(b) In Regimes IT and IV there is an axial pressure gradient at 
the cylindrical wall, the pressure decreasing from rotor to stator. 
For Regime IV, data show this gradient to be larger than, and in 
the opposite direction from, a very small gradient at intermediate 
radii. 


Acknowledgment 


This investigation was conducted at the Hydrodynamics 
Laboratory of the Massachusetts Institute of Technology as part 
of a program sponsored by the Office of Ordnance Research, U. 8. 


MARCH 1960 


Ag 
80 / | 
/ 425 Turbulent 
: 40 4 / 0.20 4.42 e Flow Doto 4 
535 
/ 0.18 
/ 
0.6 | /, / 
0.0255 00510 00765 01020 01275 =. 
02 L z/e 
Fig. 18 Axial-pressure data, cylindrical wall 
= 
I Ca 
II Ca 
111 Cn 


Army, under Contract No. DA-19-020-ORD-3491. Acknowledg- 
ment is also made to Mr. W. C. Knowles, Jr., Mr. E. G. Altouney, 
and Mr. B. Glenne, who participated in the test program. 

The authors are indebted to Professor Schultz-Grunow for 
noticing a sign error in one equation in the preprinted version 
of this paper. The appropriate corrections to figures and text 
“have been made in this final printing. 


References 


1 F. Schultz-Grunow, “Der Reibungswiderstand rotierender 
Scheiben in Gehausen,” Zeitschrift fiir Angewandte Mathematik und 
Mechanik, vol. 15, 1935, pp. 191-204. 

2 A. T. Ippen, “Influence of Viscosity on Centrifugal-Pump Per- 
formance,”” Trans. ASME, vol. 68, 1946, pp. 823-848. 

K. Pantell, ‘““Versuche tber Scheibenreibung,”’ Forschung auf 
dem Gebiete des Ingenieurwesens, vol. 16, 1950, pp. 97-108. 


4 8. L. Soo, “‘Laminar Flow Over an Enclosed Rotating Disk,” 
Trans. ASME, vol. 80, 1958, pp. 287-296. 


5 von Karman, “Uber laminare und turbulente Reibung,” 


Zeitschrift fir Angewandte Mathematik und Mechanik, vol. 1, 1921, 
pp. 244-249. 

6 E.G. Cochran, “The Flow Due to a Rotating Disk,” Proceed- 
ings of The Cambridge Philosophical Society, vol. 20, 1933-34, pp. 
365-375. 


7 J. W. Daily and R. E. Nece, ‘Roughness and Chamber Dimen- 
sion Effects on Induced Flow and Frictional Resistance of Enclosed 
Rotating Disks,’’ Massachusetts Institute of Technology Hydrody- 
namics Laboratory Technical Report No. 27, May, 1958. 

8 T. Theodorsen and A. A. Regier, ‘‘Experiments on Drag of 
Revolving Disks, Cylinders, and Streamline Rods at High Speeds,” 
NACA Technical Report 793, 1944. 


DISCUSSION 
Frank Kreith? 


The authors have made a valuable contribution to our under- 
standing of flow phenomena of enclosed rotating disks. They 
were kind enough to make the results of their work available to 
me in a preliminary form last year when I was working on a re- 
lated problem under a grant from the National Science Founda- 
tion. Their courtesy has been of great help in this work, and the 
following remarks are based on the results of this study. 

The experiments referred to were concerned with heat transfer 
and flow phenomena of partially enclosed rotating disks, i.e., 
disks rotating between two parallel stationary walls without a 
circumferential enclosure [9].* It was found that the flow and 
torque characteristics in such a system differ radically from those 
observed with a completely enclosed rotating disk when the 
stationary walls are larger than the disk. In such a system the 
radial flow at any point in the gap changes direction periodically 
and appears to be similar in nature to certain stall phenomena ob- 
served in vaneless diffusers. Radial outflow is only continuous 
in a thin layer adjacent to the rotating disk where regularly spaced 
vortex rings, similar to those on a free rotating disk, were ob- 
served. 

As a result of these observations, it seems desirable to add the 
category of “partially enclosed rotating disks’’ to those of the 
‘free disk” and the “enclosed disk.’’ In this new category one 
should distinguish between systems in which the stationary walls 
are equal to or smaller than the rotating disk and those in which 
the stationary walls are larger than the disk. In the former, the 
flow is similar to that observed by the authors in completely en- 
closed systems, but in the latter the flow pattern is periodic in 
nature. The periodic flow reversals have been investigated ex- 


perimentally and are described in detail in a paper submitted to 
ASME [10]. 


* Professor of Aeronautical Engineering, University of Colorado, 
Boulder, Colo. Assoc. Mem. ASME. 


* Numbers in brackets designate References at the end of this 
discussion. 
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In order to calculate the rate of heat transfer from a rotating 
disk by the integral method [11], the radial velocity component 
profile must be known. To perform such calculation with pre- 
cision it would be helpful if the authors would measure in future 
tests the maximum value of the radial velocity component and its 
location, i.e., complete the turbulent velocity profile in Fig. 14, 
and obtain a similar radial velocity profile in laminar flow. The 
later problem seems particularly important because, as shown by 
Deak [11], available analytical solutions are not sufficiently com- 
plete to be applied to problems of practical importance in disk 
cooling. 
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9 F. Kreith, ‘Influence of Centrifugal Forces of Heat Transfer 
and Flow,” Final Report of NSF Research Program G-3067, Novem- 
ber, 1958. 


10 L. A. Maroti, G. Deak, and F. Kreith, ‘Flow Phenomena of 
Partially Enclosed Rotating Disks,"" ASME Paper No. 59—A-49, to 
be published. 


11 G. Deak, ‘Heat Transfer From a Disk Rotating Between Two 
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George Rudinger* 


Several years ago, Reiner [12]* demonstrated an experiment 
in which a flat disk was rotated about its axis opposite a fixed 
disk. When the separation between the disk was about 0.5 mm, 
the expected suction at the center of the disk was observed. 
With decreasing spacing, suction was reduced and below about 
0.02 mm (at 7000 rpm) the pressure became larger than the ambi- 
ent pressure. Reiner interpreted this observation as being 
analogous to similar observations with viscoelastic fluids [13] and 
concluded that air should be treated as a viscoelastic fluid. Later, 
Taylor and Saffman [14] pointed out that the observations could 
be explained on the basis of the conventional Navier-Stokes 
equations as the effects of extremely small, but unavoidable, im- 
perfections of the experimental equipment. 

This phenomenon could affect the results of experiments, such 
as those described in the present paper, if the spacing between the 
rotating and stationary elements were sufficiently small. There- 
fore, I should like to ask the authors whether any of their ex- 
periments showed evidence of the centripetal-pump effect. 
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12 M. Reiner, ‘‘A Centripetal Pump Effect in Air,"’ Proceedings, 
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13. K. Weissenberg, “‘A Continuum Theory of Rheological Phe- 
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14 Sir G. Taylor and P. G. Saffman, “Effects of Compressibility 
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24, 1957, pp. 553-562. 


H. Schlichting’ and W. Pechau® 


This report represents a noteworthy contribution to the prob- 
lem of a disk rotating in a casing. This work can be regarded as 
an improvement of the so far existing theories, since with the 
help of a single treatment it leads to formulas for frictional 
moment for laminar and turbulent boundary layers taking into 
account the influence of the relative gapwidth s/a in all cases. 

It is quite interesting that the theoretical treatment for the 
regimes II and IV also contains the special case of a free-rotating 
disk, although the flow-model applied here for the case K = 0 


does not accurately reproduce the flow conditions of a free disk. 


‘Principal Physicist, Aerodynamics Research Department, Cor- 
nell Aeronautical Laboratory, Inc., Buffalo, N. Y. 

5 Numbers in brackets designate References at end of this dis- 
cussion. 

Institut fir Stré hanik, Technische Hochschule, Braun- 
schweig, Germany. 
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A further improvement of the theory could be achieved, if the 
boundary layer of the shaft would also be considered. This may 
give the possibility of prescribing the fulfillment of the continuity- 
equation as an extra condition, the neglection of which in this 
work gives occasion to some serious considerations. By taking 
into account the boundary-layer on the shaft one has to deal with 
re-entering boundary-layers, the existence of which has been 
pointed out by Grohne [15].’ 

Besides this it might be worth while to consider whether for the 
turbulent boundary layer one should introduce the logarithmic 
velocity law instead of the '/;power law in order to obtain re- 
sults, which have applicability for a larger Reynolds-number 
range. For the disk rotating in a free stream (no housing) this 
has been done earlier by 8. Goldstein [17]. A method for carrying 
out the somewhat more difficult calculations has been given by 
Dorfman [16] recently. 
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F. Schultz-Grunow® 


The authors are dealing mainly with the still unsolved question 
of how the axial clearance of the case influences the existing 
flow regimes and the frictional resistance of the disk. It is sup- 
posed that the friction on the cylindrical wall is the only influence. 
Therefore a boundary layer is introduced on this wall assuming 
the same boundary layer thickness as at the disk. 

Against the authors’ approximate theory exists the objection 
that the sign of the right-side term in the basic momentum equa- 
tion for the stator [see equation (14)] is wrong. This is the 
reason for the discrepancy between experiment and theory in 
authors’ Figs. 3(a), 3(b); 5(a), 5(b). However, revised calcula- 
tion gives good agreement with the authors’ and the discusser’s 
earlier experiments.* The authors also use a simplified moment 
of momentum equations concerning continuity [see equations (11) 
and (15)], which give at the stator other boundary layer thick- 
nesses than those of the discusser. But this and the rather rough 
assumption of a certain boundary layer thickness at the cylindri- 
cal wall seem to be of little influence on the friction coefficient 
according to authors’ extensive experiments. 


S$. L. 


This paper reports the most thorough experimental study on the 
problem of enclosed rotating disk. The inaccuracy of the 
theoretical equation in Regime III can be attributed to the as- 
sumption that the core (hypothetical in this case) rotates at half 
of the disk velocity. Another interesting aspect of this paper is 
that the measured turbulent velocity distributions are quite dif- 
ferent from the boundary layer solution obtained by the momen- 
tum integral method. 


7 Numbers in brackets designate References at end of this discus- 
sion. 

* Lehrstuhl fiir Mechanik, Technische Hochschule, Aachen, Ger- 
many. 

® Eprror’s Nore: The sign error referred to appeared in the pre- 
print of this paper and affected the theoretical curves in some of the 
figures. All appropriate corrections were made preceding the final 
printing of the paper. 

% Associate Professor of Mechanical Engineering, Princeton Uni- 
versity, Princeton, N. J. Assoc. Mem. ASME. 
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Fig. 19 Laminar velocity distribution according to Navier-Stokes equa- 
tion 


This experimental discovery actually applies to both laminar 
and turbulent boundary layers. Recent study by the discusser on 
the laminar motion over the rotating disk substantiates this 
fact. Based on an exact solution of the Navier-Stokes equation, 
velocity distributions can be expressed as: 


Tangential velocity: 
2 
u = u(y) +(°) Uy) 


Radial velocity: 
r\2 
v = + Ode) 


Axial velocity: 


r 
w = 
8 


(following notations in this paper). The trend is shown in Fig. 
19. At small radius, w, »,, and w; constitute sufficient descrip- 
tion of the motion, i.e., viscous slow motion (Regime I). wus, vs, 
w2, and still higher order terms become significant at larger 
radius, or the laminar boundary layer regime. The sum of various 
orders indicates the extent of approximation in the boundary 
layer solution. In other words, the core velocity does not have 
to be uniform in the tangential direction and zero in the radial 
direction. The pressure data in Fig. 17 substantiate this fact, 
since the inaccuracy of the boundary layer approximation is 


magnified here. 


marcH 1960 / 231 


7 
Stator Rotor Stator Rotor Stator Rotor F 7 
u 
[ 
2 
y 
ane 
* 


On the other hand, the mean radial velocity can still be ac- 
counted for in the momentum integral method from consideration 
of rate of change of angular momentum of the core and in the 
boundary layers. 


Authors’ Closure 


As Professor Kreith points out, the partially enclosed disk is an 
intermediate category between the free and totally enclosed. 
His interesting experiments were conducted without circumfer- 
ential enclosure. This can be considered, as the limiting case of 
an enlarged torus-like circumferential housing, a configuration 
that is encountered in many applications. It would be interesting 
and important to learn how the periodicities obtained by Profes- 
sor Kreith develop as the sinyvle enclosure is modified into the 
circumferential torus, and the ‘»tter, in turn, is enlarged. 

The authors agree with the desirability of knowing the com- 
plete radial velocity profile, both for turbulent and laminar flow 
cases. In future experiments it will be necessary to overcome the 
limitations due to relative probe size, as encountered during the 
present turbulent profile measurements. For laminar flow where 
either small gap dimensions or very low velocities must exist, the 
experimental difficulties are increased, and it will be difficult to 
improve on the results of theoretical computations such as 
those supplied by Professor Soo in his discussion. 

The centripetal-pump effect mentioned by Mr. Rudinger was 
considered since it was recognized that this could possibly affect 
the test results. However, the axial clearance s was in all cases 
much greater than that indicated by Taylor and Saffman [14] as 
being necessary to introduce appreciable effects. Hence these 
centripetal-pump effects, if any, were very small and were not 
observed. 

Messrs. Pechau and Schlichting comment that for Regimes II 
and IV the flow model for the case K = 0 does not accurately re- 
produce the flow condition of a free disk. The K = 0 case is 
actually meaningless for the enclosed disk system, as it is physi- 
cally impossible; on the other hand, the equations concerning the 
rotating disk alone do reduce to those for the free disk, as the 
resulting boundary conditions are equivalent. 

The authors agree that including an accurate description of the 
boundary layer on the shaft would be an improvement in the 
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theory. This was not done because of the negligible contribution 
of this boundary layer to the frictional torque. 

It is also agreed that the use of the logarithmic law to represent 
the velocity distribution would probably lead to an improvement 
in the theory at higher Reynolds numbers. On the other hand, 
over the test range covered in the experiments, neither the one- 
seventh power law nor the logarithmic law checked the measured 
velocities exactly, and there seemed little to choose between them. 

The authors acknowledge again the sign error noticed by Pro- 
fessor Schultz-Grunow in equation (14). As he points out, this 
correction leads to improved agreement between theory and ex- 
periment in Fig. 3, for Regime II; the agreement is slightly im- 
proved in Fig. 5, for Regime IV. As this error was in a radial 
momentum equation, further illustration is provided of how the 
tangential velocities overshadow the significance of the radial 
components in respect to torque considerations. 

The authors thank Professor Soo for his interesting addition 
to the theory. Two items of particular interest noted in the 
measured velocity profiles for the Regime IV case were: first, 
the existence of a small net radial outflow in the core region; 
second, the “humps” of maximum relative tangential velocities 
near both the rotor and stator. It is interesting that Professor 
Soo’s theory leads to the prediction of the first of these results 
for the corresponding laminar flow case with separate boundary 
layers, Regime II. 

It may be appropriate to comment here on the mechanism ex- 
plaining the existence of the maxima in the tangential velocity 
profiles, a point not elaborated upon in the original paper. 
These maximum tangential velocities relative to the stator and 
rotor occur within the zone of measurable radial flow, outside of 
the fluid core, over each of the circular surfaces. On the stator, 
the high tangential velocity indicates a fluid with an angular 
momentum comparable to that at a greater radius. In this fluid, 
being decelerated as it flows radially inward due to shear on the 
stator, there remains an excess of angular momentum near the 
outer edge of the boundary layer; the effect is that of the angular 
momentum “lagging’’ the local radius. The converse situation 
exists on the rotor. The measured velocities are further expres- 
sions of the secondary motions which exist. 

In conclusion, the authors wish to thank the discussers for 
their consideration and for their thoughtful comments. The 
discussions form an appreciated supplement to the paper. 
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J. P. JOHNSTON 


Ingersoll-Rand Company, 
Phillipsburg, N. J. 


On the Three-Dimensional Turbulent Boundary 
Layer Generated hy Secondary Flow 


A study of the secondary flow type of three-dimensional turbulent boundary layer is 
presented. Two objectives are achieved: (a) A mathematical model of the relationship 
between the cross-flow and main-flow components of the velocity vectors of the layer is 


established. 


(b) By utilization of the model some of the relationships required to carry 


out a boundary-layer problem solution by the use of the momentum-integral equations 


are developed. 


Introduction 


PROBLEM THAT HAS INTERESTED DESIGNERS of tur- 
bomachinery and fluid dynamicists concerns the nature of the 
boundary layer on a flat wall under the influence of a turning main 
fiow.?, Many practical problems involve flows of this type; for 
instance flow on the end wall bounding a compressor cascade. 
Often the Reynolds number of the flow is high enough to insure 
that the boundary layer is fully turbulent. Thus the many well- 
known methods developed for three-dimensional laminar bound- 
ary layers offer little or no help to the designer. 

Boundary-layer problems of the type to be discussed are com- 
monly described as secondary-flow problems because the three- 
dimensional perturbations in the layer are caused by the radial 
pressure gradient imposed on the layer by the curvature of the 


main flow. The effect generally observed is a skewing of the 


' Taken in part from ScD thesis, Mechanical Engineering Depart- 
ment, Massachusetts Institute of Technology, Cambridge, Mass., 
June, 1957. 

? Inviscid-flow region adjacent to the boundary layer. 

Contributed by the Fluid Mechanics Subcommittee of the Hy- 
draulic Division and presented at the Hydraulic Conference, Ann 
Arbor, Mich., April 13-15, 1959, of THe American Society or 
MECHANICAL ENGINEERS. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, December 
22,1958. Paper No. 59—Hyd-6. 


boundary-layer velocity vectors toward the center of curvature 
of the main flow. 

In this paper we shall discuss some recent developments in this 
field with the hope that the results presented will lead eventually 
to a useful method for computing the boundary-layer thick- 
nesses, wall shear stress, and separation tendencies of the sec- 
ondary-flow type of three-dimensional turbulent layer. The 
present paper is by no means complete in this respect nor is it 
meant to be so. 

With regard to the results to be presented, they should not be 
presumed to be applicable outside of the following restrictions: 


1 Flow is of the secondary-flow type. 

2 Fully turbulent boundary layer. 

Incompressible flow. 

4 Flow over a flat surface (or one of small curvature) where 
the usual boundary-layer assumptions are valid. 

5 Steady, irrotational, and two-dimensional main flow. 

6 Collateral* turbulent-velocity profile upstream of the region 
of turning. 


In the discussion by Taylor [1]* it is indicated that all of these 
restrictions may not be necessary, particularly with respect to the 
conclusions concerning the relationship between the components 

* Definition of a collateral velocity profile—a velocity profile in 
which all velocity vectors lie in a flat plane. 

* Numbers in brackets designate References at end of paper. 


Nomenclature 


A = a parameter U = main-stream velocity mag- tion (measured — in a 
c = magnitude of velocity vector nitude clockwise direction in Fig. 
in boundary layer u,v, w = time mean components of 1) 
To boundary-layer velocity Y = angle c makes with respect 
pl skin-frietion co- vector, in system related to U, Fig. 7 
efficient to main-flow streamlines, Y. = angle of limiting wall stream- 
~< | . Fig. 1 line and 7» with respect to 
Cre = ian skin-friction co- (u/U), = value of u/U at junction U’, Fig. 5 
2p point of model, Fig. 7 6 = boundary-layer thickness 
efficient in x-direction uy = velocity magnitude in a hy- 46,, 46, 
e = tan y, = & parameter pothetical two-dimensional 6,, 96,,> = displacement and momen- 
f(—) = indicates a function of (—) boundary layer a & tum thickness integral pa- 
G = a function of (v/6) Zz, y, 2 = streamline co-ordinate sys- rameters 
g = a function of (y/6) tem, Fig. 1 vy = kinematic viscosity 
H, = 6,/0, = shape factor Vp 3 dimensional and a nondimen- p density 
K(x) = a function of (x) sional form of y at june- To shear stress at wall 
p = static pressure tion point of model Tor, Tos xz and z-components of T> 
Re. = U6,/v = Reynolds number a main-flow-streamline angle a sum of viscous and Reynolds’ 
based on 6, relative to a fixed direc- stresses 
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of the velocity vectors in the boundary layer. However, for pur- 
poses of this paper the restrictions should be kept in mind. 

The basis of all present methods for treating boundary-layer 
problems is the set of boundary-layer equations which for 
turbulent layers must be developed into the momentum-integral 
equations. 

In the case of the two-dimensional turbulent layer only moder- 
ate analytical success has been achieved by use of the momentum 
integral equations. The many momentum-integral computation 
techniques so far developed depend on one or another correlation 
scheme for relating the experimental velocity profiles to the 
parameters involved in the integral equations (pressure gradient, 
wall shear stress, momentum thickness Reynolds number, and so 
on). Experimentally determined laws of the wall shear stress are 
also required. 

Here, by a critical review of velocity-profile data, an attempt 
is made to establish some of the empirical information required 
to carry out solutions of the momentum-integral equations for a 


secondary-flow type of three-dimensional turbulent boundary 
layer. 


The Co-Ordinate System and Equations 


For the purposes of this paper we have chosen to treat the 
problem in the streamline co-ordinate system. The principal 
reason for this choice is that, when experimentally determined 
velocity profiles are viewed in this system, certain regularities are 
discerned. Also, when the momentum-integral equations are 
written in the streamline co-ordinate system, physical meaning 
may be assigned more readily to the terms of the equations. 

The streamline co-ordinate (z, y, z) system and velocity com- 
ponents (u, v, w) of the boundary layer are shown in Fig. 1. The 
system is a right-handed, orthogonal system related to the stream- 
lines of the main flow and their orthogonal trajectories. For the 
purposes of the equations considered here, a reference streamline 
and trajectory are chosen and unit lengths are determined along 
the reference lines so that the factors h, and h, involved in the line 
element ds are unity. 


ds = h,%dz* + dy? + h,%de* 


At a point, the curvature of a streamline is expressed as 0a/Oz 
and that of the orthogonal trajectory as — da/dz. 
In the main flow Euler’s equations are: 


- U— + -=0 (3) 


Oe 


In the boundary layer one obtains the boundary-layer equa- 
tions (see Timman [2]) in the form 


ox oy oz ox oz 


u 
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Reference Direction 


Main Flow 
> Streomlines 


Orthogonol Trajectories 


Note: Y Axis Points Out of Paper 


Fig. 1 Streamline co-ordinate system 


12 
o=- (7) 
p oy 


The continuity equation in the boundary layer is 


oa + + Ou + ov + ow 0 (8) 
oz ox ox oy Oz 


Using all of the foregoing expressions, one can integrate the 
boundary-layer equations with respect to distance y from the 
wall and obtain the momentum integral equations in the form 
28, , 


+ 


ou 
or dz 


l 
9 
+ (26, + 6,) U 


in the z-direction, and 


in the z-direction. 
The six integral quantities (or thicknesses) are 


é, f (U — u)du 0, = | 
0 0 
wer 
U way U2 vr (11) 
| 
ae (U d 6 J 2 1 
(U — u)udy wudy 
0 0 


All of these quantities are finite, but not all of them are inde- 
pendent. Three of them are linked by the equation, 


6, 


6, = 6, (12) 


It is noticed in the momentum-integral equations, Equations 
(9) and (10), that the term da/dz is retained in the equations. 
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Since the main flow is assumed to be two-dimensional, da/dz may 
be represented by the term —(1/U)(0U/dz). Nevertheless, it is 
useful to retain the da@/dz form because, for instance, when 
da/dz, dU /dz, 8,,, 8,, 5,, Tos, and their derivatives are set equal to 
zero the integral equations reduce to the single momentum in- 
tegral equation for the two-dimensional flow problem: 


+ (20, + 6,) 4 

ox U pU? 
In the foregoing equation the main flow can be two-dimensional 
(with respect to the plane of the wall) as assumed only if the term 
oU /dz equals zero or if flow is added to, or subtracted from, the 
main flow along the z-direction. On the basis of similar reasoning, 
it can be shown that equations (9) and (10) are valid for cases 
where change of mass flow is artificially produced through any 
elemental volume of the main flow. Under such circumstances 
(1/U)(0U/dz) is independent of da/dz and 0a/dz is interpreted 
as the rate of convergence or divergence of the main-flow stream- 
lines (Oa@/dz is negative for divergence). Under many circum- 
stances the afore-mentioned concept is useful. 

One may establish in brief the requirements for carrying out a 
solution using the momentum-integral equations. Since only the 
main-flow quantities U’, 0U’/dz, OU’ /dz, da/dz are considered as 
known, the quantities 6,, 6,, 6,, Tox, e, and two of the quantities 
6,, 9.., 9,, are considered as unknowns. It is possible in principle 
to formulate a solution for known initial conditions of the seven 
unknown quantities. However, one needs five auxiliary relations 
in addition to the two momentum-integral equations. The main 
objective of this paper is to establish the reasoning which led to 
the development of three auxiliary equations linking the three 
independent cross-flow, integral parameters, 6,,, 6,, 6,, to the 
other parameters of the problem, and to point out a shear-stress 
law that should allow calculation with the momentum-integral 
equations if one additional relation can be established. 


Experiment 


The conclusions and results to be developed here are based 
upon three sources of information [3, 4, 5}. All represent cases 
where a well-developed, collateral, turbulent boundary layer be- 
comes three dimensional under the influence of a turning main 
flow. These are (to the knowledge of the author) the only sources 
of data of the desired type that are complete enough, accurate 
enough, and consistent enough to allow correlation and com- 
parison. 


ter 
Air at 132 


/ 


Meosurments of Boundory Layer 
on This Wall, f 


=. 


Netes: (a) Numbered Dots Approx. Positions of 


Measuring Stations (Dota in This 
Poper). 


(b) Dimensions in Centimeters 
Fig. 2 Experimental arrangement of Gruschwitz [3] 


Journal of Basic Engineering 


The Experiment of Gruschwitz [3|. This experiment was per- 
formed in a turning passage of rectangular cross section preceded 
by a straight rectangular duct in which a turbulent boundary 
layer had been caused to grow, Fig. 2. The value of Re, at the be- 
ginning of the turn was approximately 10%. At many stations on 
one of the flat walls, velocity profiles were determined by means 
of traversed cylindrical, three-holed, stagnation pressure,"and 
angle probe. Static pressures were determined by means of wall- 
static taps. 

The Experiments of Kuethe, McKee, and Curry [4]. These involved 
measurements of the three-dimensional turbulent boundary layer 
on a yawed wing of elliptical planform set up in a low-speed wind 
tunnel, Fig. 3. The Reynolds number, based on the root chord of 
18 in., was 7 X 10°, and Rg, fell in the range of 2 to 5 XK 10? at the 
measuring stations on the wing. Velocity profiles were measured 
by hot-wire anemometry. 

The Author's Experiment [5]. This experiment was carried out 
over the flat wall bounding a two-dimensional air jet forced to 
flow against a perpendicular back wall. A uniform, collateral, 
turbulent boundary layer was caused to grow on the walls of the 
rectangular entry duct, Fig. 4. The value of Rg, at the inlet 
plane was 5.5 X 10 and the layer was about 1.1 in. thick at this 
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point. Measurements of velocity profiles were made on and 
around the plane of symmetry at many stations upstream of the 
separated regions that occur on the test-section walls ahead of 
the back wall. A three-holed cobra probe’ traversed perpendicu- 
larly to a test-section wall was used to determine flow angles and 
stagnation pressures. Static pressures were determined at wall 
taps. Pressures were indicated on galvanometers by the use of a 
multiranged set of Statham Laboratories (models P5TC-0.2D-350 
and P97TC-0.05D-350) unbonded strain-gage pressure trans- 
ducers which were calibrated against a Foxboro micromanometer 
having an uncertainty level of +0.001 in. of H,O. The uncer- 
tainty level in pressure differential could be reduced to as low as 
+0.005 in. of H,0. 

An additional set of data, that of Moore and Richardson [6], 
has been used to help demonstrate the relationship of the parame- 
ter A to theory (see following). These data, unfortunately, could 
not be used in their entirety due to the peculiar inlet conditions 
established by the authors. 


Choice of Model 


Velocity-component profiles generally look like those sketched 
in Fig. 5. The conventional method of studying crossflows in 
turbulent boundary layers has been to study crossflow w and 
main flow u components as functions of distance from the wall y. 
However, as we shall try to demonstrate, the study of velocity 
profiles as functions of y does not lead to as fruitful results as 
studying w as a function of u and the parameters e and A. As 
will be shown, A is a parameter closely related to shearless nature 
of the flow and e® is closely related to the frictional character of 
the flow. 

To the author’s knowledge only two general models of turbulent 
velocity profiles of the type we are discussing have previously 
been proposed. 


For the first general model Prandtl [7] suggests the assumption 


(13a) 


(136) 


where G and g are universal functions of y/6 and are in general 
only restricted by the boundary conditions: 


at 7] 


at 


If this model is valid, one would expect that the function w/(eU’) 
= gG is a universal function of y/6. Mager’ presents some data 
of Gruschwitz [3] and concludes that gG is a universal function of 
y/6. Fig. 6 shows the region in which Gruschwitz’s data fall plus 
some data from reference [5]. 
spread. 


of y/6. 


The data shown give a typical 
It must be concluded that gG is not a universal function 
As a matter of fact, the description of the main-flow pro- 


* Probe tip consisted of three 0.024-in-OD, thin-walled, nickel hy- 
podermic tubes soldered together. The center tube indicated stagna- 
tion pressure. The pressure difference across the outer tubes (cut 
back at 60-deg angles) was nulled by rotating the probe, thereby 
giving an angle measurement. The tip of the probe was '/2 in. ahead 
of any stem effects. 

® As shown in Fig. 5, yw is the angle that to makes with U. e = tan 
yw. It is pointed out in reference [5] that three-dimensional separa- 
tion on a flat wall may occur where: (a) 79 goes to zero; (b) limiting 
wall streamlines tend to become tangent to each other. Since 
yw (or e) is related to the direction of limiting wall streamlines it be- 
comes an important parameter when considering separation in three- 
dimensional flows. 

? Reference [8], Fig. 3. 
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file by the function G is certainly not acceptable in the light of our 
present-day ideas of turbulent boundary layers in two-dimensonal 
flow.® 

Mager [8] suggests specific functions for G and g. 
the two power functions 


He assumes 


(14a) 


(146) 


As Mager points out, these functions will not always give a good 
representation of three-dimensional turbulent boundary layer, but 
they do seem to represent Gruschwitz’s [3] data fairly well for a 
value of n = 7. 

®See Coles [9] and Clauser [10], who hold that turbulent two- 


dimensional profiles cannot be described accurately as a universal 
function of a single variable such as y /6. 
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Using equations (14a) and (14) the so-called shape parameter 
may be expressed as: 


2+n 
= 


n 


Similarly, the ratios of the other momentum-integral quantities to 
6, are functions of n ande. If it is assumed that n is a constant, 
the ratios are proportional to e or its square. 


= 


For instance, if 


ae 


= ().542¢ = 2.67e 


= 


= 1.13e? 


Mager used relationships like the foregoing, with slightly different 
constants, together with an empirical shear-stress law and the 
momentum-integral equations in a calculation scheme. His 
calculation of the distribution of integral quantities for Gru- 
schwitz’s bend case shows good agreement with experiment. 
Nevertheless, this simple model would be unsatisfactory for com- 
puting a flow that had large changes in velocity-profile shape (for 
instance near separation). Gruschwitz’s data cover only a very 
small region of possible skin frictions, shape factors, and so on, 
and therefore are rather limited in nature. 

Moore and Richardson [6], after studying their data, retained 
the general profile assumption of Prandtl] but found that G and g 
could be better expressed in the form 


G = G(y/0,, H,) 


y 


The main-flow-profile function G is similar to the formulation of 
von Doenhoff and Tetervin [11] for two-dimensional boundary 
layers. 


(15a) 


(15b) 


Although Moore's model is not as restrictive as the 
power-law profiles of Mager, it still represents a universal function 
of a single variable y/@, where H, replaces the parameter n 
used by Mager as the function parameter. This formulation is 
also a little more general than Mager’s in that a variable r re- 
places the constant exponent 2 in the function for g. 

In brief, Moore {6} was able to achieve better agreement with 
the experiments of Gruschwitz [3] than did Mager [8]. He was 
also able to achieve fair agreement with the data of reference [4]. 
To represent the function G, Moore used the profiles suggested 
by von Doenhoff and Tetervin [11] and in general the u/U 
profiles agreed quite weil with experiment. The cross-flow pro- 
files w/U did not show such good agreement. Finally, Moore 
found that he had to let r vary in order to get the best fit to the 
data. With H,, r, and 6, as unknown variables as well as e and 
To:, three auxiliary equations are needed in addition to the 
momentum-integral equations. One might be able to borrow an 
equation for H, and tT», from two-dimensional methods. How- 
ever, this model doesn’t seem to give any clue as to the third 
equation because of difficulties in relating the exponent r to dy- 
namic parameters of the problem. 

It is the author’s opinion that most of the difficulty with the 
general model of Prandtl [7] and its special forms [8] and [6] 
is due to the basic profile assumption which not only fails to fit 
the available data but by its very nature does not allow us to re- 
late it easily to physical influence of the main flow on the bound- 
ary layer. For instance, one knows from simple shearless sec- 
ondary-flow theory that the angle of turn @ of the main flow is 
the major parameter in the reorientation of vorticity vectors in 
the boundary layer, and the development of crossflow is a direct 
result of the reorientation of the vorticity [12]. In the model 
assumption of Prandtl it is very difficult to see how the angle @ is 
related to the model. 
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The second general model has been proposed recently by Coles 
[9]. On the basis of his well-established correlation of two-dimen- 
sional turbulent-velocity profiles, Coles generalizes his results by 
proposing that the universal wall component of the profile may be 
added vectorially to the universal wake component to establish 
the profile shape of a skewed turbulent-velocity profile. This 
procedure was carried out with success for some of the velocity 
profiles of Kuethe, et al. [4]; however, Coles felt that the Gru- 
schwitz [3] data contained errors that would not allow their use 
in correlation. The data of the author [5] were not available to 
Coles at the time of his writing. However, the author attempted 
some preliminary correlation by Coles’ method with little success. 
It is felt that, due to the relatively small skewing of the profiles of 
reference [5], the data would not have provided significant results 
particularly with respect to the law of the wake component.* At 
the present time not enough data are available to establish Coles 
yawed turbulent boundary-layer hypothesis firmly. 

A third general model is proposed here.” In this model one 
accounts for an apparent consistency in the behavior of the cross- 
flow component w as a function of the main-flow component u. 
In its most general form the model may be expressed by the func- 
tional equation 


(16) 


which says that w/U can be expressed as a function of u/U, the 
familiar parameter e, and a parameter A. It will be demonstrated 
that A is related to a, the main-flow turning angle. 

It should be noted that dependency of w/U on distance from 
the wall has, in this formulation, been discarded. The y depend- 
ency is implicit in the fact that u/U must be described as a func- 
tion of y to allow a complete profile description. In the present 
paper no attempt is made to describe this relationship completely 
but a simple assumption in the region near the wall allows the de- 
velopment of useful information. 


REGION (T) — REGION 


Locus of tip of 
/sdboundory loyer 
velocity vector 


The specific functional form of equation (16) suggested by the 
data is shown in Fig. 7. This sketch represents a polar plot 
(projected on the plane of the wall) of the locus of the tip of the 
boundary-layer velocity vector c. At u/U = 
main flow and at u/U = 0 one is at the wall. 
into two regions, divided at (u/U),. 
wall 


1 one is at the 
The flow is broken 
In the region (I) near the 


and in the region (II) near the main flow 


*In a recent private communication to Professor Coles, the author 
submitted his original data for further study. 
” This model was presented by Taylor [1] in his survey paper. 
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A (: - (18) 
w/U fits the main-flow and wall boundary conditions of zero 
velocity. We shall call the present model the “triangular model” 
for the purposes of this paper. 


Experimental Confirmation of the Triangular Model 


Presented in Fig. 8 are the data of the author’s experiment [5] 
in the fcrm of the polar plot. Fig. 9 shows the data of Gruschwitz 
([3], and Fig. 10 shows the data of Kuethe, et al. [4]. 

Two straight lines representing the model assumption are 
drawn for each set of data in Figs. 8, 9, and 10. The line on the 
right-hand side is the best straight line that could be drawn 
through the data in the outer part of the boundary layer, region 
(II). From this line experimental values of the quantity A were 
determined. The other straight line represents the model in re- 
gion (I), near the wall. 

Since there are few data showing in region (I), some explanation 
of the method of determining the line in region (I) is necessary. 
The author’s data [5] and the data of Gruschwitz [3], Figs. 8 and 
9, show the last measurable points very near the peak of the polar 
plot. One way to determine the line in region (1) is to extrapolate 
angle measurements to the wall. In the case of Fig. 9, the line in 


region (I) was determined from the value of y,, determined by 
Gruschwitz by extrapolation. In the case of the author’s data, 
Fig. 8, angle extrapolation tended to give values of y,, that seemed 
too large by a few per cent. A careful examination of the two or 
three data points nearest the wall for some of the data indicated 
that yy remains constant for small distances from the wall; 
therefore, it was felt that the best determination of y, was ob- 
tained by putting the straight line of the model through the origin 
and one or two of the points of lowest u/U. As far as the data 
of reference [4] (Fig. 10) are concerned, enough occur in region (I) 
so that the line could be faired through it with little difficulty in 
most cases. 

As mentioned before, very few data exist in region (I). The 
reason for this is that region (I) generally seems to exist only 
close to the wall. The thickness of region (I) was found to be less 
than a few per cent of the total boundary-layer thickness in all 
cases. This observation will be discussed in detail later since it is 
used to form an important assumption about the model. 


Flow Region (11) 


Discussion of Parameter A. Mager, in a discussion of reference 
[3], indicated that equation (18) may be proper in the region of 
the boundary near the main flow. 

As Mager points out, it is reasonable to expect shearless 


Fig. 8 Polar plots of data of Johnston [5] 
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analysis to yield valid results in the outer part of boundary layers 
which are well developed before being perturbed by the turning 
main flow. Under such circumstances, crossflows are developed 
as a result of the reorientation of the vorticity in the layer [12]. 
Of course, fluid friction comes into play but as far as the crossflow 
is concerned the effects of friction are felt only near the wall. As 
the flow proceeds through a turn, frictional effects propagate out 
through the crossflow. However, compared to the initial bound- 
ary-layer thickness the frictional sublayer in the crossflow is rela- 
tively thin for sharply turned flows. 

Mager’s analysis is reviewed in the Appendix and extended to 
cases where there are pressure gradients along main-flow stream- 
lines. Mager found, for the case where 0U’/dOr = 0a@/dz = 0, 
and by assuming w< u, that he could arrive at a solution for w 
as a function of u, U, and @ by using the boundary-layer equations 
when shear-stress terms were neglected. His solution is 


w 

U 
where @ is measured relative to the flow direction at the beginning 
of the turn (along a main-flow streamline). 


In the outer part of turbulent boundary layer u ~ U so that 
one may rewrite equation (19) in the approximate form 


w 9 u 
U 


It is seen that equation (20) has the same form as equation 


(19) 


20) 
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(18) (derived from observation of data) if one lets 
A = -—2a (21) 


Note that equation (20) has been derived by many workers in 
the field of secondary flow in bends by consideration of the re- 
orientation of the vorticity in the boundary layer when it passes 
through a turn under circular-arc, main-flow streamlines [12]. 

In the extension of Mager’s analysis (Appendix) it was as- 
sumed that w = A(U — u) and a solution for A was sought for 
cases where (O0U/or) # O and (0a/dz) # 0. The following dif- 
ferential equation valid along a main-flow streamline is derived in 
the Appendix: 


U 
or oz or or 
If the main flow is two dimensional it can be shown that 
al da 
- = U 23 
or oz 


Substituting equation (23) into equation (22) it is seen that 


A la 
a(4) = -3 (24) 


Integration of equation (24) along a main-flow streamline from 
a=A = 0toa gives 


aq 
A= f (25) 
0 


It is seen that A = —2a for the case where U’ equals a constant 
along a streamline. 

Also note that A is only a function of main-flow quantities. 
It is therefore calculable and may be considered a known quantity 
as far as the boundary layer is concerned. 

Figs. 11 and 12 show a comparison between theoretical and ex- 
perimental values of the parameter A. Theoretical values were 
obtained using equation (25) and measured values of U and a. 
In the case of the data of reference [6] the values of w/U plotted 
against u/lU’ to get experimental values of A were the actual 

values of w/U less the values of w/U at the beginning of the turn 
(where a = 0). 

Fair agreement is achieved between theory and experiment in 
some cases, but good agreement is achieved in the majority of 
cases, 


Flow Region (1) 

Parameter e and Skin-Friction Coefficient. In the limit, as one ap- 
proaches the wall, it is assumed that the angle which a boundary- 
layer vector has with respect to the main flow approaches the 
angle 7, which the wall shear-stress vector has with respect to 
the main flow. In other words, 


u 
limit = tan y, =e 
yo u 


This assumption forms the basis for the assumption that equation 
(17) is a reasonable approximation in the region (1). 

As long as the region (I) is very thin relative to the total 
boundary-layer thickness (as the data indicate), the approxima- 
tion expressed by equation (17) should not be too far from the 
truth. 

Unfortunately, there is no simple theory that may be applied to 
the prediction of the parameter e in terms of quantities of the 
main flow only. e must be considered as an unknown parameter 
in the problem as compared to the parameter A which, as shown 
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in the preceding section, depends on the main-flow turning angle 
a and velocity U. 

The triangular-model assumption in the region (I) has some 
interesting consequences with respect to current two-dimensional, 
boundary-layer theories, which lead to a method of obtaining the 
wall shear-stress coefficient C, from the velocity-profile measure- 
ments. 

The model assumption, equation (17), implies that boundary- 
layer velocity vectors in region (I) are collateral. Thus, one 
could imagine the existence of a two-dimensional boundary layer 
pointing in the direction of the wall shear stress and coincident 
with the real skewed boundary layer in region (I). Let the 
velocity at any point in this hypothetical two-dimensional 
boundary layer be u,. In region (I), 


(26) 


cos sin Y, 


Clauser [13] reports a method for determining C, for two- 
dimensional boundary layers based on the so-called “law of the 
wall.”""!| Clauser’s method uses only the velocity profile and the 
kinematic viscosity, v, for determination of skin-friction co- 
efficient. The law of the wall having the constants Clauser uses 
is written as 

= 5.6 log +49 (27) 
(To/p) v 

For two-dimensional, incompressible boundary layers on 
smooth walls the relation (27) has been found to represent ade- 
quately the velocity profiles in a region near the wall (outside the 
laminar sublayer) even for cases where pressure gradients exist. 

The assumption made here is that the hypothetical two-dimen- 
sional boundary layer which is coincident with the skewed 
boundary layer in region (I) is represented by equation (27). 
Thus in region (I) one would expect that 


u 1 w 1 


U sin y, 


(<)" 15.6 lone | + sot (28) 


An investigation of the data of references [3, 4, 5] indicates 
that the region of applicability of equation (27) or (28) extends 
only from the outer part of region (1) into the inner part of region 
(II) of the triangular model. Even though one would expect the 
relation (28) to be valid only in region (1), it is assumed that it 
gives an adequate representation of the velocity profiles in the 
lower parts of region (II). 

The data of references [3, 4, 5] were plotted in the form indi- 
cated by equation (28) in order to determine the value of the 
skin-friction coefficient C,. It must be remembered that the 
assumptions used for the determination of C,, by this method 
need to be checked by direct measurement of wall shear stress. 
It is the author’s opinion that the correlation expressed in equa- 


U cos 


tion (28) is the correct one, at least for small crossflows. 
The main-flow component of C, is related to C, by the equa- 
tion 


+ = C, (29) 


It was assumed, for small crossflows (e small), that C,, might 
be predicted by one of the simple two-dimensional methods. 

A formulation useful in the presence of pressure gradients was 
deduced by Ludweig and Tillman [14] which is assumed to be ap- 
plicable when written in terms of quantities defined in the main- 
flow direction only; 


1! Also see Coles [9]. 
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and by method of equation (30) 


= 0.246 [exp(—1.561H,) (30) 


For two-dimensional flows, equation (30) is assumed to give 
discrepancies in C,, of less than 3 per cent for Rez = 10* to 4 X 104. 

Fig. 13 shows a comparison between values of C,, obtained 
from equation (30) using experimental values of H, and Re, and 
values of C,, obtained from the law-of-the-wall method and ex- 
perimental values of e. It is seen that equation (30) gives values 
of C,, somewhat higher than those from the law-of-the-wall 
method in the region below C,;, = 0.002. It is known that the 
Ludweig-Tillman formulation predicts values of skin-friction 
coefficient that are too large in the region of low skin-friction co- 
efficient; C,, in equation (30) cannot go to zero except for H, = 
©. However, the discrepancy in regions of large C,, are at 
present unaccounted for. 

No effect of crossflow, i.e., parameter e, was noticed in the 
vglues of C,, predicted by the assumption of equation (30). 
This may be due to the scatter of the data which obscures any 
trend due to the crossflow. The scatter may be accounted for by 
the uncertainties in the various experimental quantities (prin- 
cipally H,) that were used in the computation of C,,. 


Formulation of Relations Between Integral Quantities, 
Model Parameters, and Skin-Friction Coefficient 


The triangular-model assumption may be used in the defini- 
tions of the integrated quantities in the momentum-integral equa- 
tions, but one cannot form definite relationships between the in- 
tegrated quantities and the model quantities until one makes 
some assumption about the shape of the u/U profile as a function 
of y. 

Due to the form of the model, one need only designate the shape 
of the profile in either region (1) or region (II) in order to form the 
desired relations. As pointed out earlier, observations of data 
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Fig. 13 Comparison of value of skin-friction coefficient determined by “law-of-the-wall"’ method 
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METHOD OF EQUATION 30 


indicate that region (I) is very thin and extends just outside the 
laminar sublayer. On the basis of this observation two assump- 
tions are made: 


(a) Region (I) lies within the laminar sublayer. 
(b) Velocity profile in the laminar sublayer is linear and may be 


expressed by 
( To ) 
c= 
v p 


Of course, the linear-profile shape in the laminar sublayer as ex- 

pressed by equation (31) is valid for two-dimensional boundary 

layers only in the limit as y approaches zero in cases where a pres- 

sure gradient exists. However, due to the thinness of the sub- 

layer, deviations from equation (31) should be small. 
Recognizing in region (I) that, 


(31) 


= u(l + e%)' and = + 


and using the definition of C,, one sees that equation (31) can 
be rewritten as 
CyRos ( 


Referring to the sketch of the triangular model, Fig. 7, it is seen 
that the value of u/U at the junction point between regions (I) 
and (II) may be written as 


= 


A 


32 
U v 2 


(33) 


Using equations (32) and (33) one may evaluate y at the junc- 
tion point P as 
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The six integrated quantities in the momentum-integral equa- 
tions may be written out (using the model assumption) as fol- 


+f 
Up 


Oz 


In equations (35), the integrals from 0 to y,/6, are over region 
(I) and the integrals from y,/@, to © are over region (II). If we 
substitute for u/U by use of equation (32) in the integrals over 
region (I) and evaluate the limit y,/@, according to equation (34) 
the integrals in region (I) can be evaluated. Since only three of 
the six integrals in region (II) are incependent we may solve for 
the quantities in the cross-flow direction (6,, 6,, @,,, and 6,,) as 
functions of the integral quantities in the main-flow direction 
(6, and 6,), C,,, Re, and the model parameters (e and A). These 
functions are 

2 
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1 (¢/A —1) | 
3 (e/A + 1) 


ari) - 
e/A+1 


(39) 


Since each of the foregoing relationships involves six or seven 
quantities, it would be difficult to check them by correlation of 
data. However, one may reduce the number of quantities by 
further assumptions. 

In two-dimensional turbulent-boundary-layer work it has been 
found that the point where the linear, laminar-sublayer profile in- 
tersects the log-linear, turbulent profile defines a constant,'* 


u 
This constant is found to equal 10 or 12 for two-dimensional 
boundary layers. 


One may write a nondimensional number 9, similar to the 
foregoing but based on y, 


(=) 


Using equation (40) together with equation (34) and the 
definitions of C,, and Rg, one obtains the relationship 


/ 
A 


(40) 


(41) 


In Fig. 14, equation (41) is plotted for constant values of §,. 
Also shown are experimental data from references [3, 4,5]. It is 
seen that most of the data fall between values of §, = 12 and 16. 
To a first approximation this suggests that 7, is a constant as is 
the number 4; of two-dimensional turbulent boundary layers. 

If we assume that 7, = 10/2 ~ 14, equation (41) may be 
written in the form 


(42) 


Using equation (42) and the simplified shear-stress relation, 
equation (30), together with equations (37), (38), and (39), one is 
able to produce a set of charts of the cross-flow integral quantities 
6,,/(A8@,), 0,/(A%0,), and 6,/(AQ@,) as functions of H, for constant 
values of Rg, and e. 

Such charts were produced, Figs. 15, 16, and 17, which span a 
range of H, from 1.0 to 2.6 for values of Reg = 10° and Rez = 10%. 
Two lines of constant e are shown for each Rg,. These values of 
Re, and e are quite adequate to span the range of the data availa- 
ble. It is observed that for values of Rg, > 10‘, the right-hand 
terms in equations (37), (38), and (39) may be ignored thereby 
allowing the parameters 6,,/(A0,), 0,/(A°0,), and 6,/(A8@,) to 
be expressed as simple linear functions of H,. The straight 
dashed lines in Figs. 15, 16, and 17 represent these linear func- 
tions. 

The data of references [3, 4, 5] are shown on these charts. The 
scatter of the data (especially evident in Fig. 17) is princi- 
pally due to inaccuracies in the measured integrated quan- 


13 y; is the value of y at the intersection of the two profiles. 
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Fig. 15 Comparison of equation (37) with experiment, assuming equa- 
tions (30) and (42) are valid 
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Fig. 17 Comparison of equation (39) with experiment, assuming 
equations (30) and (42) are valid 
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Fig. 16 Comparison of equation (38) with experiment, assuming equa- 
tions (30) and (42) are valid 


tities of the crossflow. Regardless of the scatter, it is evident that 
most of the data lie between the lines of constant Rg, where one 
would expect them to lie if our model assumptions and the simpli- 
fying assumptions of this section are valid. As a matter of fact, 
considering the number of simplifying assumptions used, the de- 
gree of correlation shown in Figs. 15, 16, and 17 seems quite 
good. 


Discussion of Results 


As pointed out earlier, five auxiliary equations are required to 
carry out a solution of the problem by means of the momentum- 
integral equations. The purpose of this discussion is to bring 
forward those results that may be used as auxiliary relations and 
to point to areas where further information is required. 

The triangular model and the assumption of a linear-velocity 
profile in region (I) led to the results expressed in equations (36), 
(37), (38), and (39). They express the relationship of the inte- 
grated parameters of the crossflow to the other parameters in- 
volved in the problem. Three of these four equations are inde- 
pendent and therefore they supply three of the five auxiliary 
equations required for a calculation scheme. The modified 
shear-stress equation of Ludweig and Tillman, equation (30), 
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supplies a fourth relation. However, the information necessary 
to give the fifth and final auxiliary equation is not apparent from 
the work developed in this paper. 

The missing information concerns the way in which the main- 
flow component of the boundary-layer-velocity profile (u/U versus 
y) shape varies with respect to all of the parameters and quantities 
involved in the problem. The desired relationship must be quite 
similar in nature to one of the many empirical ‘‘shape-parameter”’ 
equations developed for the purpose of computing two-dimen- 
sional turbulent-boundary-layer problems because, when the 
three-dimensional problem is reduced to the two-dimensional 
problem, the three independent equations developed here for the 
cross-flow parameters reduce to zero, but a single relationship for 
the shape parameter is still required. 

It is the author’s opinion that the available data are not suf- 
ficient'* to establish the fifth and final relationship even if such a 
procedure is possible. It is hoped that the work presented here 
will stimulate further work in this field and in particular will cause 
new experiments to be planned and conducted. 


Conclusions 

In this paper two objectives have been achieved. First, a con- 
ceptual model, the triangular model, of the secondary-flow type 
of three-dimensional turbulent boundary layer has been estab- 
lished which allows correlation of all the available data. Also, a 
study of the data with the aid of the triangular model permitted 
the establishment of four of the five auxiliary relations re- 
quired to carry out a boundary-layer-problem solution using the 
momentum-integral equations. 

In particular, one may cite the following conclusions with re- 
spect to the triangular model: 


1 The layer may be split into two regions; an inner region (1) 
adjscent to the wall, and an outer region (IT) adjacent to the main 
flow. 

2 In region (1) the boundary-layer-velocity vectors are col- 
lateral and their components in the streamline co-ordinate system 
may be expressed by the simple relation 


w= eu 


Furthermore, it may be assumed that the velocity profile in the 
region (I) is approximately linear and may be expressed by the 
relation 


3 In region (II) the velocity vectors are not collateral but 
their components in the streamline co-ordinate system may be 
expressed by the simple relation 


w = A(U — u) 


It is also demonstrated that the parameter A is related to the 
turning angle of the main flow. In the case of circular-are- 
shaped, main-flow streamlines 


A = -—2a 


as is to be expected from considerations of simple secondary-flow 
theory. 

On the basis of the foregoing conclusions, four equations which 
express the relationship of the cross-flow integral quantities # 
6,,, 9,, 6,, to the main-flow integral quantities 6, and 6,, the model 
parameters e and A, and the quantities C,, and Rg, have been 
established. Three of these four equations are independent and 


7s 


18 See discussion relating to the model of Coles [9], the establish- 
ment of which would also require much more data than are now 
available. 
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may be used with the momentum-integral equations for caleula- 
tion purposes. It is shown that the Ludweig-Tillman skin-fric- 
tion law also may be used for calculation purposes. Finally it is 
pointed out that the fifth equation required for a calculation has 
not and could not be established from the presently available 
data. 


Acknowledgments 


The research forming the basis of this paper was conducted in 
the Massachusetts Institute of Technology Gas Turbine Labora- 
tory under the sponsorship of General Electric Company, West- 
inghouse Electric Corporation, Curtiss-Wright Corporation, 
Allison Division of the General Motors Corporation, and the 
Office of Naval Research, USN. The author is indebted to Prof. 
Edward 8. Taylor for his encouragement and guidance. 


References 


1 E. 8. Taylor, “The Skewed Boundary Layer,”’ Trans. ASME, 
Series D, Jounnat or Basic ENGINEERING, vol. 81, 1959, pp. 297- 


2 R. J. Timman, “The Theory of Three-Dimensional Laminar 
Boundary Layers,"’ from proceedings of a symposium held at the 
National Physical Laboratory entitled, ‘Boundary Layer Effects in 
Aerodynamics,’’ Her Majesty's Stationery Office, London, England, 
1955. 

3 E. Gruschwitz, “Turbulent Reibungsschichten mit Seckun- 
darstromung,” Ingenieur-Archiv., vol. 4, 1935, pp. 355-365. 

4 A.M. Kuethe, P. G. McKee, and W. H. Curry, ““Measure- 
ments in the Boundary Layer of a Yawed Wing,’"”’ NACA TN 1946, 
September, 1949. 

5 J. P. Johnston, “Three-Dimensional Turbulent Boundary 
Layer,”’ ScD Thesis, Massachusetts Institute of Technology, 1957; 
also available as M.I.T. Gas Turbine Laboratory Report No. 39, 
May, 1957. 

6 R. W. Moore, Jr., and D. L. Richardson, ““Skewed Boundary- 
Layer Flow Near the End Walls of a Compressor Cascade,”’ Trans. 
ASME, vol. 79, 1957, pp. 1789-1800. The study referred to in this 
paper is from their paper, ‘‘A Study of the End Wall Boundary Layer 
in an Axial Compressor Blade Row,"’ Gas Turbine Laboratory Report 
No. 33, M.I.T., October, 1955. 

7 L. Prandtl, “On Boundary Layers in Three-Dimensional 
Flow,”’ Reports and Transactions, No. 64, British M.A.P., May, 
1946. 

8 A. Mager, “Generalization of Boundary Layer Momentum 
Integral Equations to Three Dimensional Flows Including Those of 
Rotating Systems,"’ NACA Report 1067, 1952 

9 D. Coles, ‘‘The Law of the Wake in the Turbulent Boundary 
Layer,” Journal of Fluid Mechanics, vol. 1, part 2, July, 1956, pp. 191- 
226. 

10 F. H. Clauser, Turbulent Boundary Layer,”’ Advances in 
Applied Mechanics, vol. 4, Academic Press, Inc., New York, N. Y., 
1957, pp. 2-51. 

11 A. E. von Doenhoff and N. Tetervin, ‘Determination of Gen- 
eral Relations for the Behavior of Turbulent Boundary Layers,”’ 
NACA Report No. 772, 1943. 

12 H. B. Squire and K. G. Winter, “‘The Secondary Flow in a 
Cascade of Airfoils in a Non-Uniform Stream,”’ Journal of the Aero- 
nautical Sciences, vol. 18, 1951, pp. 271-277. 

13. F. H. Clauser, ‘‘Turbulent Boundary Layers in Adverse Pres- 
sure Gradients,"’ Journal of the Aeronautical Sciences, vol. 21, 1954, 
pp. 91-108. 

14 H. Ludweig and W. Tillman, “Investigation of the Wall 
Shearing Stress in Turbulent Boundary Layers,”"”’ NACA TM 1285, 
May, 1952. 


APPENDIX 


The boundary-layer equations along a main-flow streamline are 
given in equations (5), (6), (7). If one drops the terms relating 
to shear stress from equations (5) and (6) and utilizes equations 
(1), (2), (4), one can express equation (5) as 


u v Oy +u 


da 


and equation (6) as 
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First consider the case treated by Mager (discussion of [6]). 
Mager assumes that 0U/dzr = 0 and implies that da/dz = 0. If 
one makes the same assumption, one simplifies equations (43) 
and (44) considerably. Furthermore, if one assumes, following 
Mager’s reasoning, that w is small compared to u (of the same 
order of magnitude as v) then equations (43) and (44) reduce to 


(45) 
ox oy 
and 
ow ow 


oa 
us (u ) (46) 


Eliminating v from the foregoing one obtains 
— = — — UX) — (47) 
If one assumes a solution in the form 
w = UK(a)f(— (48) 
= zx — 
U 


and introduces it into equation (47) one finds that K = aandf = 
(u/U) — (U/u) so that 


u U 
w=a (+ (49) 


Since this solution is valid only where u ~ U (near the main 
stream) equation (49) reduces to the simple form 


w (1 u (50) 
a 
U U 
As pointed out in the text, equation (50) satisfies the model 
assumption in region (II), equation (18), where A = —2a. 


One can now extend Mager’s analysis for cases where da/0z and 
oU /dzx are not equal to zero if one assumes that w is related to u 
in the form of equation (18). First substitute for w in equations 
(43) and (44) with w = A(U — u) and then eliminate v from the 
resulting equations. As a result one obtains 


dA oA oa 
u + A — AL u)] 


ou 


It is recalled that the assumption about the form of w is only 
valid where u is approximately equal to U. Therefore it is not 
unreasonable to assume that one may drop terms multiplied by 


(U — u) and let u = U. Doing so one finds that equation (51) 
takes the simple form 


oA da ou oa 
U -~— AU — -A = —2U — 52 
or oz Oz or (62) 


DISCUSSION 


Donald Coles'* 


Some of the author’s experimental data are analyzed in Fig. 18 


'* Associate Professor, California Institute of Technology, Pasa- 
dena, Calif. 
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according to the flow model proposed in my paper [9]. The resolu- 
tion of the mean-velocity field is in terms of (1) a logarithmic 
“wall’’ component in the direction of the surface shearing stress, 
shown by the open points, and (2) the residual flow, shown by the 
solid points. In my paper it is suggested that the second com- 
ponent ought to have a characteristic wake-like form. It is also 
suggested, as a general hypothesis for yawed flows, that this 
“‘wake’’ component might turn out to be in the direction of the 
pressure gradient. 

The analysis here has been made just as outlined on pp. 222- 
224([9], beginning with the profile component in the direction of the 
free stream, and the treatment of the data is quite unambiguous. 
The long dashed lines in Fig. 18 show the approximate position of 
lines of constant free-stream velocity, and hence of constant pres- 
sure. The data therefore support the supposition that the second 
profile component should be in the direction of the pressure 
gradient. On the other hand, the form of this second component 
is very far from fitting the curve which is proposed as a universal 
wake function in my paper [9] and is shown here as an S-shaped 
line near the solid points for each profile. 

Since the author’s measurements undoubtedly provide an ac- 
curate representation of the state of his flow, it must be con- 
cluded that my two-component model as originally formulated 
is not valid in this case. However, because the attempted fit in my 
paper [9] to the data of Kuethe, McKee, and Curry /4] was more 
successful, I want to raise two questions concerning Dr. John- 
ston’s experiments. First, his observations of surface dye traces 
apparently indicate a mean flow direction very near the surface 
which is at some variance with the probe data. This discrepancy, 
although it suggests a direction for the wall component which 
makes the fit to my flow model worse rather than better, ought 
at least to be accounted for. Second, the poor fit for the wake 


component is also found on the flow center line (e.g., Station D3 in 


FLOW ~— 
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Fig. 18), where it cannot easily be explained by effects of yaw. 
The observed change in shape for the wake component is very 
like that produced by a high free-stream turbulence level, and 
it therefore seems proper to ask whether or not the present ex- 
periments might have been made in the presence of increased 
large-eddy mixing as a result of abnormally large fluctuations 
occurring outside the boundary layer proper and associated with 
strong secondary flow in a relatively confined channel. 


Arthur G. Hansen'® 


This paper is most interesting and informative for several 
reasons. First, the author has presented a novel and useful ap- 
plication of the momentum-integral technique to three-dimen- 
sional turbulent boundary-layer flows. Secondly, his critical 
evaluation of current research in this area is of value in its own 
right. Finally, his review of existing experimental information 
on secondary flows has sharply pointed up the need for additional 
research. 

Turning to specific technical details, it is quite obvious that 
the author should include the metric components hz and h; in 
the various equations given in the paper. 
make this correction. 


It is a simple matter to 
Fortunately, the analysis is not signifi- 
cantly affected by the oversight 

An important point which is difficult to assess from the infor- 
mation given concerns the general applicability of the triangular 
velocity diagram. It must be noted that most of the supporting 
experimental data were apparently taken either in regions where 
the boundary layer had become quite thick, or in regions which 
were fairly far removed from the origination of the secondary 
flow. In order to gain additional insight into this problem of ap- 
plicability, the discusser calculated some polar velocity diagrams 
in a written discussion of the paper, “The Skewed Boundary 
Layer,’’ by E.S. Taylor[1]. The diagrams were based on several 
exact solutionsof three-dimensional, laminar, boundary-layer flows 
over a flat plate. Calculations were made at various distances 
from the plate leading edge. The results indicated that in the 
leading edge region the diagrams curved smoothly from the wall 
position to the mainstream position and were not triangular. 
Only at some distance from the leading edge was a triangular 
plot approached. It is difficult to generalize from these results 
obtained specifically for a particular class of laminar flows. Nev- 
ertheless, the indicated trends suggest that an experimental 
study of secondary flows near leading edge regions might be most 
helpful in determining criteria for the validity of the triangular 
model. 

It is hardly necessary to point out that the author’s methods 
(and for that matter, the general theory) will tend to break down 
in corner regions of blade rows and channels where wall effects 
become predominant. One might wish that significant theoretical 
research on this aspect of secondary flows existed to supplement 
work such as that presented here. Unfortunately, only the most 
preliminary steps have been taken in this direction by research 
workers in the field. 


Artur Mager’® 


The reviewer would like to congratulate the author on advanc- 
ing some original ideas regarding the treatment of three-dimen- 


sional turbulent velocity profiles. Although the complete com- 
putational method for such boundary layers is still lacking, 
author’s mode of description of the velocity profile appears to be 
a step in the right direction. 


1% Head, Nucleonics Section, Cornell Aeronautical Laboratory, Inc., 
of Cornell University, Buffalo, N. Y. 

% Technical Consultant, ASTRO-Marquardt Corporation, Van 
Nuys, Calif. 
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Aside from these general and complimentary remarks, the re- 
viewer notes that there appear to be certain objectionable state- 
ments in the paper, which though inconsequential to the main 
ideas, still should be pointed out. Most of these objections 
originate in the section entitled, “The Co-Ordinate System and 
Equations’’ and therefore, since the equations themselves are in 
question, may possibly also have a bearing on some of the results 
obtained by the author. 

Thus, for example, the author states that by choosing unit 
lengths along the reference lines he is able to set the length 
stretching factors h, and h, to unity. While this statement is 
true on the reference lines, it is not true everywhere in the field. 
This is known from the differential geometry, which says that 
such choice can be made only for orthogonal geodesics (here, on a 
flat surface, these would have to be straight lines) and is also 
apparent from author’s Fig. 1. In this figure consider, for ex- 
ample, travel along the z co-ordinate through two z units (e.g., 
from reference streamline to the lowest streamline illustrated). 
Clearly, the actual distance traveled (e.g. s) along the orthogonal 
trajectory at z = 2 is much larger than that distance along the 
reference orthogonal trajectory (e.g., at z = 0). Consequently 
(since dz is zero and ds = h,dz) h, is not equal to unity atz = 2. 

The fact that h, and h, are not unity, except on the reference 
streamline and reference o.t. respectively, leads to further dis- 
crepancies. Thus, for example, 0a/dzr and 0a/dz do not repre- 
sent the curvatures of the streamlines and orthogonal trajectories 
away from the reference lines. This can be seen from the follow- 
ing argument: Since the surface chosen is flat, therefore the total 
curvature (e.g., the Gaussian curvature) must be zero every- 
where. This means that: 


1 


where K, and K, are the curvatures (geodesic) of the streamlines 
and orthogonal trajectories respectively. Equation (53) may be 
satisfied in general by a function (say a) defined so that: 


AX, = (54) 
and thus the derivatives of a do not represent the curvatures 
except, again, on the reference streamline and reference o.t. where 
h, and h, are locally (and only respectively) unity. One thus sees 
that a@ is indeed the reference direction. 

Equation (54) and the definitions of the curvatures permit to 
establish the appropriate relations between a, h,, and h,: 


1 dh, da 1 


; = 
h, Oz or 


AK, = 
h, Or 


showing again that A, and h, cannot be unity everywhere if the 
derivatives of a do not vanish. 

The fact that h, and h, are in general different from unity 
leads the reviewer to question the form of author’s subsequent 
equations (e.g. 1, 2, 3, 4, 5, and 6). Although, as shown by Stru- 
minskii, it is possible to eliminate by certain transformation the 
h, and h, factors from most of the terms, the form thus derived 
appears to lead to certain additional terms not occurring in 
author’s equations. On the other hand, it is also possible that the 
equations as given by the author are valid in the immediate 
neighborhood of the reference lines. The reviewer had no time 
to pursue this matter further and would appreciate some explana- 
tion from the author (Reference [2] is not easily available). In 
any case, it is obvious that there is a discrepancy between author’s 
Equations (2) and (6) and in particular, the reviewer believes, 
that in Equation (2) da@/dz should be da/dzr. This last state- 
ment, of course, means that also Equations (44) and possibly 
(51), (52), and (22) do not appear to be correct. 
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The reviewer would like to stress again that while in his opinion, 
the objections pointed out are serious and need be clarified by the 
author, they do not detract from the original idea suggested by 
Dr. Johnston. This idea which apparently permits description 
of the three-dimensional turbulent velocity profile fairly ac- 
curately is important enough in itself to make this paper a 
valuable addition to the three-dimensional boundary layer 
literature. 


Yasutoshi Senoo'’ 


The author is to be complimented for the careful review on 
three-dimensional turbulent boundary layer as well as for the re- 
markable proposal on the triangular model. The writer and his 
associate have worked on three-dimensional boundary layer in a 
vaneless diffuser and experimentally confirmed validity of the 
triangular model. The boundary-layer flow in the diffuser was 
theoretically predicted using a momentum method and the re- 
sults showed a remarkable agreement with experimental data. 
In that case because of axi-symmetry the equations were much 
simpler than the general case presented here. However, the 
writer believes that a similarly good agreement will be achieved 
for general cases between experimental data and the theoretical 
prediction by the present method provided a reasonable main- 
flow velocity profile is assumed. 


E. S. Taylor’ 


The difficulties of Clauser [13] in producing a two-dimensional 
(collateral) boundary layer experimentally is evidence that in an 
adverse pressure gradient a collateral boundary layer is the excep- 
tion rather than the rule. Certainly in turbo-machinery it seems 
evident that the conditions for establishing a collateral boundary 
layer are rarely present and it can be eypected that the boundary 
layer is, in general, skewed. 

The author’s contribution to the understanding of the skewed 
boundary layer is therefore especially weleome. In view of the 
difficulties of measuring, describing, and analysing the collateral 
turbulent boundary layer, Dr. Johnston’s courage in tackling this 
still more difficult problem and his success in coming up with a 
relatively simple approach is to be applauded. 


Author's Closure 


The author wishes to thank Professor Coles, Dr. Hansen, Dr. 
Mager, Professor Senoo, and Professor Taylor for their comments 
on the present paper. Herein, we shall attempt to answer, in 
order, the questions raised. 

Professor Coles has analyzed some of the author’s original data 
[5] by the assumptions of his flow model [9]. He comes to the 
ecnclusion that his model does not fit the author’s data in the 
wake component, and he raises two questions concerning the ex- 
periments. 

The first concerns the variance between the angle of flow near 
the wall, as indicated by probe data, and that which he measures 
from the photograph of the surface dye traces shown in Fig. 15 of 
reference [5]. The author's measurements of angles from the dye 
traces on the original data sheet, which was much larger in scale 
and not distorted as Fig. 15, ref. [5] may be, showed agreement 
within the accuracy of measurement between the probe data and 
the dye trace data. 


'7 Assistant Professor, Department of Mechanical Engineering, 
Massachusetts Institute of Technology, Cambridge, Mass. 

18 Professor of Aircraft Engines, Gas Turbine Laboratory, Massa- 
chusetts Institute of Technology, Cambridge, Mass. Mem. ASME. 
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Secondly, Professor Coles feels that the differences between the 
predicted and experimental shape of the wake component profiles 
in the author’s experiments may be due to abnormal fluctuations 
in the flow outside the boundary layer. The data which he ana- 
lyzed, in particular the D3 and D4 profiles, are very near the 
line of separation. Small, rapid flucuations in the position of the 
separation line and in static pressure in the vicinity of the separa- 
tion line were observed. Thus the degree of free stream flucuation 
may indeed be larger than normal in the locale of the analyzed 
data. 

Finally, with respect to Professor Coles’ remarks, the author 
did not intend to imply that his model of the yawed three-dimen- 
sional boundary layer is not valid. In our opinion, one is not able 
to give it a fair test with the data now available. It certainly 
behooves other workers in this field to check its validity against 
their data. 

Dr. Hansen points out the absence of the length stretching fac- 
tors h, and h, in the equations of the paper. This item is taken 
up in the discussion of Dr. Mager’s comments. Dr. Hansen also 
points out areas where the triangular model may be of questiona- 
ble validity. We concur with him in his comments. In 
particular, it was pointed out in the paper, perhaps not 
strongly enough, that the results will probably be valid only in 
regions where a well developed, collateral, turbulent boundary 
layer enters a region of turning. This restriction would rule out 
the application of the triangular model in leading edge regions. 
The author also pointed out in a written discussion of reference 
{1] some evidence for doubting the validity of the triangular 
model in the case of laminar layers. 

Dr. Mager questions the validity of equations (1) through (6) 
in the paper. The basis of his criticism is the fact that the line 
element stretching factors h, and h, cannot be equal to unity ex- 
cept at the origin of co-ordinates or in cases where derivatives of a 
vanish (rectangular co-ordinates). 
in his conclusions. 


Dr. Mager is indeed correct 
Upon restudy of the derivation of the equa- 
tions, the author found that of the first ten equations only 
equations (1) and (7) were correct at all co-ordinate positions. 
The remainder are correct only at the origin of co-ordinates where 
h, and h, may arbitrarily be set equal to zero. In order to make 
equations (2) through (6) and (8), (9), (10) correct at any co- 
ordinate position, the factor 1/h, should be multiplied into all 
terms containing derivatives with respect to x, and 1/h, should be 
multiplied into all terms containing derivatives with respect to z. 
For example, equation (5) should read as: 


u ou + ou + w Ou uw Oa@ Oa 
h, Ox oy h, Oz h, Ox h, 
1 OF 
(56) 
ph, Ox p oy 


to be correct at all co-ordinate positions. Similar corrections 
should be made in equations (22), (23), (43), (44), (45), (46), (51), 
and (52). 

In addition to this matter, Dr. Mager points out a discrepancy 
between equations (2) and (6). This error in the preprint version 
of the paper should not appear in this final edition. A recheck by 
the author of equations (44), (51), (52), and (22) indicated that 
they are correct except for the insertion of the h, and h, factors as 
pointed out in the foregoing. 

It is most interesting to note in Professor Senoo’s comments 
that additional experimental evidence is available from vaneless 
diffuser studies, to confirm the triangular model proposed in the 
paper. It is hoped that the results of this work will be published 
in the near future. 
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Addendum to 
Elastic and Damping Properties of Cylindrical Journal Bearings’ 


In extending the work presented in this paper, I uncovered a 
few points that needed revision. I felt it my duty to bring these 
to the attention of the reader, in order that he may not introduce 
errors by using the unrevised data. This is the main reason for 
this addendum. Furthermore, by extending the text just a little 
it can be shown that the results presented there have more general 
applicability than the paper may imply. 


Revisions 


1 The heading of the second column of Table 1 should read 
V,'/C'N’ = V;'/C'N’. 

2 The title under Fig. 2 should read "Dimensionless components 
of force as a function of dimensionless velocity.” “The abscissa should 
be Vp'/C’N’ = V,'/C'N’. 

3 The title under Fig. 6 should read "Journal loci as a function of 
eccentricity ratio for equal radial and tangential velocity.” 


In the text (e.g., page 103 last and second to last paragraphs) 
when reference is made to Table 1 and Figs. 5 and 6, it should 
be remembered that these calculations were performed for equal 
radial and tangential squeeze film velocity. 


Extension 


It can be shown that solutions presented for radial and tangen- 
tial squeeze film velocity can be transformed to those containing 
purely radial squeeze film velocity. 
derived below. 


The transformation will be 
Equation (2) in the above paper can be written in 
the following form: 


Op’ 
(1 + n cos 6’)8 + R" (1 + n cos 0’) 
06’ ob’ Oz’ 


= Op n — }sin 


V,’ 
+4 cont’ | (la) 


where ’ corresponds to dimensional quantities. 

The integration of equation (1a) yields the pressure distribution 
in the bearing corresponding to the condition at which the inte- 
gration is performed. By further integration of the pressure 
within the noncavitating region one obtains the net force which 
the fluid film exerts on the journal and, thus, one may evaluate 
the components of force in two mutually perpendicular directions. 
(These can be either horizontal and vertical or radial and tan- 
gential forces. ) 

For dimensional analysis considerations, these components can 
be expressed in the following form: 


\'w'F (». 


( 

Cc’ 
Ve’, Vy’ are the radial and tangential components, respectively, 
of the journal velocity, and where Fy and Fy are dimensionless 
forces dependent on the dimensionless variables shown. 


2 
where A‘w’ ) with dimensions of force and 


1 By B. Sternlicht, published in the June, 1959, issue of Trans. 
ASME, Series D, Journat or Basic ENGINEERING, pp. 101-108. 
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The right-hand member of equation (1a) is linear in U', V7’, 
Vp’. It would appear from this that Reynolds equation for 
incompressible fluids can be integrated separately first for a sta- 
tionary journal center (U’ # 0, V;’ = Vp’ = 0), then for a radial 
velocity of the journal center (U’ = Vy’ = 0, Vp’ # 0) and, 
finally, for a tangential velocity of the journal center (U’ = Vz" = 
0, V7’ # 0) and hence by superposition, the pressure (and thus 
the forces Fz and F,) obtained for the joint effects of U’, V2’, and 
V,’. This condition, however, is vitiated because of the region 
of cavitation which changes in a manner depending on all three 
velocities. 

Nevertheless, examination of the right-hand side of equation 


(la) shows that at least U’ and V,’ enter in the form nU’ 


— 2R'V,'} sin 6. Thus, solutions for V;’ = 0 can be trans- 


formed to cases where V7’ # 0; this transformation is shown be- 
low and two examples are given. 
Suppose that equation (1a) is integrated for a given L’/D’ and 
the forces F, and F, evaluated for V,’ = 0, yielding 
F a = 
F,’ 


—X'w'F p(n, n) 

(3a) 
F p(n, n) 
V,’ 


where r= 
Then, for nonzero V,’, the force may be derived, without 
further integration, by replacing w’ in (3a) by 


2 V,’ 
= w’ . Atthe same time the 
an C'N 


equation 


dimensionless parameter n must be divided by (: — Son} 


Hence for V;’ # 0, the fluid film forces become: 
(n, 
gac’n’) , 


— (: 


= \'w’ F y(n, fi) 


F’ 


Example 1 


From Table 1, 
V,’ 


V,’ 
C’N’ 


Given 


1 0.5 
Transform the above to the case where V;’ = 


2V,’ 


mnC'N’ 


MARCH 


i960 / 249 


* 
R’ T 
“h 
“ate 
(4a) 
where i= 
2Vr = 
anC’'N’ 
(2a) 150.8 
2x2 
1— =1+ = 3.54 
F = 44.7/3.54 = 12.6 
—2 
and = = —0.564 
3.54 
3 


Therefore, 


1 0.5 -—0.564 0 12.6 150.8 


This example shows that the cases with radial and tangential 
squeeze film velocity can be transformed to ones where V7’ = 0. 


Example 2 


Next, consider the case where only radial squeeze film velocity is 
assumed, e.g., 
V,’ 
C'N C'N’ 
—0.564 0 


a 


12.6 150.8 


Find a new dimensionless force Vg'/C’N’ for which V,'/C’N’ = 
— 6. 
2X 6 


— «1 + 
anC'N’ 


250 / MARCH 1960 


F = 12.6 X 8.65 = 


2V 
j= -) = 


109 


and mn —0.564 X 8.65 = —4.87 


Therefore, 


—6 109 150.8 


Example 2 is more general than Example | for it shows that 
solutions with only radial squeeze film velocity present can be 
transformed to a number of cases where V,’ ~ 0. One of these 
is the case given by Example 1 and another is the example pre- 
sented above. 

It may be concluded that solutions to Reynolds equation, 
where V,’ = 0, can be transformed to cases where V7’ # 0, and 
vice versa. Thus analysis in which radial squeeze film velocity 
is included present the general dynamic solutions to the Reynolds 
equation in which the Sommerfeld solution is only a limiting case 
of Vz’ = 0. These general solutions will be presented in a forth- 
coming publication. 
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